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Abstract: We carry out simple analytical calculations and Monte Carlo studies to better 
understand the impact of QCD radiation on some well-known jet substructure methods for 
jets arising from the decay of boosted Higgs bosons. Understanding differences between 
taggers for these signal jets assumes particular significance in situations where they perform 
similarly on QCD background jets. As an explicit example of this we compare the Y-splitter 
method to the more recently proposed Y-pruning technique. We demonstrate how the insight 
we gain can be used to significantly improve the performance of Y-splitter by combining it 
with trimming and show that this combination outperforms the other taggers studied here, 
at high pr- We also make analytical estimates for optimal parameter values, for a range of 
methods and compare to results from Monte Carlo studies. 
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1 Introduction 


In recent years the detailed study and analysis of the internal structure of hadron jets has 
become an area of very active investigation. The principle reason for such high interest has 
been in the context of Higgs boson and new physics searches at the LHC and associated 
phenomenology. Due to the large (TeV scale) transverse momenta that can be accessed at 
the LHC, electroweak scale particles, such as the Higgs boson, can be directly produced with 
large boosts. Alternatively the decay of yet undiscovered heavy new particles to comparatively 
light standard model particles, such as top quarks or W/Z bosons, would also result in the 
production of boosted particles whose decay products would consequently be collimated. 
This in turn means that rather than producing multiple resolved jets, a significant fraction 
of the time the decay products are encompassed in a single fat jet. Understanding the 
substructure of such jets therefore becomes crucial in the context of discriminating between 
jets originating from QCD background and those originating from signal processes involving 
e.g. Higgs production and its hadronic decay. 

Though pioneering studies were carried out by Seymour several years ago [1], and the 
Y-splitter method for tagging W bosons was subsequently introduced in Ref. [2] over a decade 
ago, the revival of interest in jet substructure is relatively recent and owes essentially to 
seminal work by Butterworth, Davison, Rubin and Salam [3]. These authors revealed the 
power of substructure analyses by studying the discovery potential for a light Higgs boson 
( Mh ~ 120 GeV) in the process pp —> W/Z,H with Higgs decay to bb. They demonstrated 
that exploiting the boosted regime and applying jet substructure methods, specifically a 
mass-drop and filtering analysis, was sufficient to turn what was previously regarded as an 
unpromising channel into one of the best channels for Higgs discovery at the LHC. Several 
other applications followed, dedicated to new physics searches as well as top and W/Z boson 
tagging, and numerous substructure techniques are now in existence and being commonly 
employed in experimental analyses both in the context of QCD measurements as well as for 
searches [4-14], For the original articles introducing a selection of some of these methods we 
refer the reader to Refs. [15-25] while comprehensive reviews of the held and further studies 
are available in Refs. [26-29]. 

Most recently research has started to emerge [30-33] which aims at enhancing our un¬ 
derstanding of jet substructure methods via the use of analytical calculations that, where 
possible, lend greater insight and provide powerful complementary information to that avail¬ 
able purely from traditional Monte Carlo (MC) based investigations of jet substructure. In 
Ref. [31] in particular, resummed results were provided for jet mass distributions for QCD 
background jets after the application of a variety of jet substructure methods (that we shall 
collectively refer to as ‘taggers’) and detailed comparisons to MC studies were carried out. 
Jet mass distributions were examined for the case of trimmed [21] and pruned jets [19, 20] as 
well as for jets obtained after the application of the mass-drop tagger [3]. 

One of the main aims of Ref. [31] was to better understand how aspects of tagger defi¬ 
nition and design may interplay with QCD dynamics to dictate the performance of taggers 
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as reflected by their action on background jets. The improved analytical understanding that 
was achieved led to a better appreciation of the role of tagger parameters (including the dis¬ 
covery of the apparent redundancy of the mass-drop parameter /x in the mass-drop tagger 
[3]). The analytical studies also paved the way for improvement of theoretical properties of 
taggers. Examples of improvements that were suggested or made in Ref. [31] included the 
design of taggers with a perturbative expansion more amenable to resummation as for the 
modified mass-drop (mMDT) as well as removing undesirable tagger features as for the case 
of pruning via the Y-pruning modification. Subsequent work has also demonstrated how an 
analytical understanding of the action of taggers on QCD background can be exploited to 
construct valuable new tools such as the soft drop technique introduced in Ref. [33]. 

While thus far there has been heavy focus on taggers applied to QCD background, until 
now radiative effects for signal jets have not been investigated in the same level of analytical 
detail for many commonly used substructure methods. Detailed analytical calculations have 
however been performed to study the action of filtering for H —>• bb [34] and for N-subjettiness 
[35], while the role of QCD radiation in the context of template tagging was discussed in 
Ref. [23]. 

We first observe that it is common to study high jjt signal jets in some relatively narrow 
mass window of width ~ 5AI around the mass, M of some boosted decaying heavy resonance 
of interest, this mass cut being a first step in tagging signal jets. One then has a situation 
where there are various disparate scales involved in the problem such as the (potentially) 
TeV scale transverse momenta of the fat jets, the mass M of the resonance (which for our 
studies we can consider to be around the electroweak scale) and the width of the window 
SM which for most purposes we can consider as a parameter ~ 10 GeV. These scales are in 
addition, of course, to the various parameters corresponding to angular distances and energy 
cuts introduced by tagging and jet finding. 

It is well known that in such multi-scale problems radiative corrections have the potential 
to produce large logarithms involving ratios of disparate scales. In the example of filtering 
studied in Ref. [34] large logarithms in Mh/5M arose from considering soft emissions, which 
were accompanied by collinear logarithmic enhancements in R^/R^n, the ratio of the bb 
opening angle to the filtering radius. On the other hand Ref. [31] observed via MC studies 
of the signal that for the taggers studied there (mMDT, pruning, trimming,Y-pruning) the 
tagger performance was primarily driven by the action of taggers on QCD background, with 
signals not appearing to display very sizeable radiative corrections for the default parameters 
chosen there. 

In order to better understand these apparently contrasting observations it is desirable to 
acquire a higher level of analytical insight into the action of taggers on signal jets. When 
comparing the performance of taggers one may also meet a situation where two taggers shall 
act essentially similarly on background jets and hence their action on signal becomes of critical 
significance. We shall in fact provide an explicit example of this situation later in this article. 
It is also of importance to understand and assess the impact of QCD radiative corrections 
and non-perturbative effects on tagger efficiency for signals, to ascertain what theoretical 
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tools (fixed-order calculations, MC methods, resummed calculations or combinations thereof), 
should ideally be deployed to get the most reliable picture for the signal efficiency for a given 
tagger. 

With all the above aims in mind, here we embark on a more detailed study dedicated to 
signal jets. We shall focus our attention on the case of a jet arising from a boosted Higgs boson 
for a process such as Higgs production in association with a vector boson pp —»• W/Z, H, 
with H —y bb, where we will work in a narrow width approximation. For our analytical 
approximations we shall also typically consider highly boosted configurations i.e. those where 
the Higgs has transverse momentum px 3> Mu and shall further take a fat jet with radius 
R ^$> We shall work in a small-angle approximation throughout, though we will often 
consider R rsj \ # 

We stress here that we do not intend to provide precise high-order calculations for ra¬ 
diative corrections to any given process but seek mainly to understand and compare the 
behaviour of taggers via a combination of approximate analytics and MC cross-checks. For 
an example of exact fixed-order calculations involving jet substructure and signal processes 
we refer the reader to Ref. [36]. 

We start in the next section by analysing the case of a plain jet mass cut focussing on a 
mass window around the signal mass and deeming a jet to be tagged as signal if the jet mass 
falls within this window. We consider the impact on signal efficiency of initial state radiation 
(ISR), final state radiative corrections (FSR) both analytically and in MC studies. We also 
study the impact of non-perturbative effects (NP) with MC. The results so obtained can then 
provide a point of reference and comparison to judge the improvements that are offered by 
use of substructure taggers, which impose requirements in addition to a simple cut on mass. 

Next we move to analysing jets with application of various taggers. In section 3 we study 
trimming at lowest order and with ISR and FSR corrections. We investigate the logarithmic 
structure that emerges in M, 5M and px as well as in tagger parameters and compare to MC 
results where appropriate. We also study non-perturbative corrections, though purely with 
MC results. In section 4 we analyse along similar lines pruning and the modified mass-drop 
tagger (mMDT). FSR is analysed further for these taggers in appendices A and B where we 
also compare parton shower results to those from full leading-order calculations i.e. those 
that go beyond the soft/collinear approximation. 

In section 5 we study Y-pruning and the Y-splitter tagger [2], We observe that while 
the action of Y-splitter on QCD background is similar to Y-pruning, the signal jet with 
Y-splitter is subject to severe loss of resolution due to ISR and underlying event (UE) effects. 
We show with MC studies that combining Y-splitter with trimming dramatically improves 
the signal behaviour while leaving the background largely unmodified. As a consequence we 
show that Y-splitter with trimming outperforms the other taggers we study here, especially 
at high px■ To our mind this example further illustrates how even a relatively basic analytical 
understanding of all aspects of taggers (for both signal and background) can be exploited to 
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achieve important performance gains. 1 

Finally we carry out analytical studies of optimal values for tagger parameters, obtained 
by maximising signal significance, and compare to MC results. We conclude with a summary 
and mention prospects for future work. 


2 Results for plain jet mass 


Here we shall consider the plain jet-mass distribution for fat signal jets without the application 
of substructure methods, other than the imposition of a mass window 8M, as previously 
stated. As also mentioned before, we shall consider the case of Higgs boson production 
in association with an electroweak vector boson pp —> W/Z,H with Higgs decay to a bb 
quark pair and shall work in a narrow width approximation throughout. For the purposes 
of examining the jet substructure we shall not need to write down matrix elements for the 
production of the high px Higgs boson and shall instead be concerned purely with the details 
of the Higgs decay and the resulting fat jet, as well as the impact of ISR, FSR and non- 
perturbative effects. Let us take a boosted Higgs boson produced with transverse momentum 
Pt Mh and purely for convenience set it to be at zero rapidity with respect to the beam 
direction, so that the corresponding energy is \Jp\ + Mfj. We further consider Higgs decay 
into bb so that in terms of four-momenta one has pn = Pb + Pi- Thus the invariant mass of 
the Higgs can be expressed as 

Ph = m h = 2 Pb ■ Pi = 2 ~(1 - z)(P t + m h) (! - cos ^bb) > ( 2 -l) 


with z and 1 — z the energy fractions of the decay products and 9 h i the bb opening angle, 
where we neglected the b quark masses. Furthermore we shall consider the highly boosted 
regime taking A = -C R 2 and shall systematically neglect power corrections in A. Then 

P t 

from Eq. (2.1), taking a small-angle approximation, we obtain the standard result: 




A 


z(l-z)' 


( 2 . 2 ) 


Requiring that the Higgs decay products are contained in a single fat jet, 6R < thus 
translates into a constraint onz: 

z {l~ z )> w (2.3) 

Let us start by providing the results for the signal efficiency, £g°^ to lowest order i.e. 
taking just the H —> bb decay without any radiative corrections. This can be considered as 
the fraction of decays that are reconstructed inside a fat jet of radius R. Here one has to 
consider the relevant Feynman amplitude for H —> bb and the full decay phase-space with an 

1 One other recent example of simple analytical arguments, based on power counting, being used to good 

effect for top tagging can be found in Ref. [37]. 
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integral over the final state parton momenta. However for the fraction of decays inside the 
fat jet, £g°\ we just obtain 


-( 0 ) _ 
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(2.4) 


which is trivially in good agreement with corresponding MC event generator results with 
all ISR, FSR and non-perturbative effects turned off. The above result simply suggests, 
as one can easily anticipate, that with increasing boosts, i.e. smaller A, the efficiency of 
reconstruction inside a single jet increases. At this lowest-order level there is of course no 
role for the mass-window SM since the jet mass Mj coincides with the Higgs mass Mh- 


2.1 Initial state radiation 

Let us now account for the impact of initial state radiation on the jet mass distribution. 
We can anticipate that the impact of soft radiation may be significant here because of the 
fact that we require the invariant mass of the fat jet to be within SM of the Higgs mass, 
with SM < pt- This requirement imposes a constraint on real emissions, arising from ISR, 
that enter the jet since these contribute directly to the deviation of the jet mass from Mh- 
Hence one can expect that large logarithmic corrections arise as a consequence. In order to 
understand the structure of the logarithmic corrections that arise, we consider the process 
pp —> ZH with the additional production of soft gluons radiated by the incoming hard partons 
(qq pair). Let us start by taking a single ISR gluon which is soft i.e. has energy u <C pt- In 
the soft limit we can work with the eikonal approximation in which production of the ISR 
factorises from the Born-level hard process pp —> ZH. To compute the signal efficiency we 
shall require the jet invariant mass to be within a relatively narrow mass-window SM of the 
Higgs mass: 

M h - SM < Mj < M h + SM. (2.5) 

As we observed previously at lowest order (Born level) this inequality is always true since 
Mj = Mh, however with ISR it amounts to a constraint on the ISR gluon energy. Neglecting 
corrections of order SM 2 we can write: 


M 2 — Mj] = 2 pH • k < 2MhSM, 


( 2 . 6 ) 


where pn is the four-momentum of the Higgs (or equivalently the sum of the four-momenta 
of its decay products) and k that of the ISR particle. Defining 8 as the angle between the 
soft emission and the Higgs direction we can write: 

2 pH • k = 2u; ^ \jP t + Mh ~ Pt cos ■ (2-7) 

Since we take A = Mjj/pj- <C 1 and also use the small 8 approximation, we can expand 
in small quantities and write the following constraint on gluon energy: 


2 M h SM 

p T {8 2 + A)' 


( 2 . 8 ) 


- 6 - 



One can express this equation in terms of standard hadron collider variables kt , p and 
defined wrt the beam direction by noting simply that lo = kt cosh p and 6 2 ~ p 2 + q i 2 . 

We wish to examine only the leading logarithmic structure that arises from soft ISR 
emissions, starting with a single emission i.e. to leading order in the strong coupling. Since 
we are considering an emission that enters the high px fat jet we are concerned with large- 
angle radiation from the incoming hard partons. This in turn implies that there are no 
collinear enhancements associated to such radiation and the resulting leading logarithmic 
structure ought to be single-logarithmic, arising purely from the infrared singularities in the 
gluon emission probability. 2 

The gluon emission probability is in turn given by the standard two particle antenna for 
soft emissions off the incoming quark/anti-quark: 


Wij = 2C F - 

7 r 


{pi ■pj) 

C Pi ■ k)(pj ■ k )' 


(2.9) 


where Cf = 4/3. Analogous to the case of jet mass distributions for hadron collider QCD jets 
(see for example calculations in Ref. [38]) the ISR contribution can be written by integrating 
Eq. (2.9) over the gluon emission phase space. The result can be expressed in terms of kt, p 
and </> and reads: 


e (1) 

e S,ISR 


dzQ 



z) 



x 2 Cf 



d(j> a s (k t ) 

27T 7r 



2 M h 6M 
Pt (0 2 + A) 


kt cosh p 



( 2 . 10 ) 


The above equation contains an integral over the energy fraction of the partonic offspring 
involved in the Higgs decay (i.e. that over z), with a constraint that is identical to the zeroth 
order requirement that the hard quarks be contained in the fat jet, which is unmodified by 
the presence of soft ISR at leading logarithmic level, i.e. in the limit oj = kt coshr? <C px- 
The step function involving a restriction on the transverse momentum kt follows directly from 
Eq. (2.8) and the subsequent arguments. Virtual corrections are incorporated via unitarity 
through the —1 term also in parenthesis. Lastly we have a factor 0j e t that is the condition 
that the soft ISR is within the fat jet. 

The clustering condition ©j e t is in principle quite complicated since it involves recom¬ 
bination of three particles within the fat jet, namely the b, b and the ISR gluon. In our 
approximation of A <C R 2 i.e. in the limit of large boosts, we are considering a highly colli¬ 
mated quark pair, relative to the radius of the fat jet. One can thus ignore the effect of the 
finite bb opening angle as these effects contribute only terms that are relatively suppressed by 
powers of A compared to the leading term we compute. Then one only has to consider the 

J In practice this expectation is challenged by the discovery of superleading logs [39]. Since these appear 
at order a* and are suppressed as 1 /Nq we do not expect them to make a significant impact on the essential 
arguments we make here. 


-7 - 






fact that the soft ISR gluon is in the interior of the fat jet which amounts to the condition 
©jet = © {R 2 — (y 2 + , since we had taken the Higgs rapidity as zero. 

Within the context of the current purely order a s calculation we shall also consider the 
coupling as fixed at scale px and ignore its running. Running coupling effects are of course 
important to include for leading logarithmic resummation and we shall do so for our final 
answers. We define £s,isr = £g°^ + e siSR an d carrying out the relevant integrations with fixed 
coupling we get from Eq. (2.10) the leading logarithmic result: 

£S,ISR 


do) 




7r 


( PfR 2 \ 
\2M h 5M} ' 


( 2 . 11 ) 


In order to obtain the above result starting from Eq. (2.10) one can first integrate over 
kt, discarding the coshr/ accompanying factor as this will only generate subleading terms. 
The integral over kt produces the large logarithm we seek. One can then express the 77 , 0 
integral as one wrt 9 2 = p 2 + q i 2 and then integrate over 9 2 with the condition 9 2 < R 2 . With 
neglect of subleading terms, including those which vanish with A, one then obtains the result 
reported above for the quantity £s,isr/ £ s°^' We have however chosen to retain the formally 
subleading logarithmic dependence on R which can become important at smaller values of R, 
e.g. R ~ 0.4. 

Given the large single logarithms that emerge from the above approximate fixed-order 
calculation, it is natural to wish to attempt to resum at least the leading logarithms to all 
perturbative orders. This is far from a straightforward exercise. One of the main obstacles 
to performing a soft single log resummation, in the present context, is the presence of non- 
global logarithms, associated clustering logarithms [40-43] as well as super leading logarithms 
referred to previously. Such calculations pose a serious challenge to the current state of the 
art and are beyond the scope of our work. 

In the absence of a complete resummed calculation one can still obtain a working estimate 
that can be compared to MC results, simply by exponentiating the order a s result obtained 
above and by including running coupling effects. 

The exponentiated result including the running of the QCD coupling is given by 

£ P 

-^«exp(-2 C F R 2 t) (2.12) 

e s 


where we defined the single-log evolution variable 




dk t 

h 


u s {h) 


1 

47t/3 0 


In 


1 


1 - 2A’ 


A 


/ %ot s {PT ) In 


p 2 T R 2 
2 M h 5M' 


(2.13) 


where /3o = (HG 4 — 2 rif), and we shall use nj = 5. Note that we have indicated the 
exponentiated result by the superfix P, which indicates the resummed contribution from pri¬ 
mary emissions alone i.e. excluding secondary emissions which lead to non-global logarithms. 










We observe here that the perturbative calculations break down at A = 1/2 which corresponds 
to an ISR emission with kt ~ Aqcd 5 the QCD scale. This translates into a value of 5M which 
is 

"Hf (2 ' 14) 

where dMjxp is the point of breakdown for perturbative calculations. Taking a value of 
Aqcd = 1 GeV for pt = 3 TeV we can deduce that we should not use perturbative results 
below 6M ~ 12 GeV. 

Although we have emphasised that our estimate of the ISR corrections to the signal 
efficiency are incomplete, even to leading logarithmic accuracy, it is nevertheless of interest to 
compare to MC event generators. This is at least in part because MC generators themselves 
do not attain full single logarithmic accuracy and certainly exclude superleading logarithms. 
They do however contain a number of effects that would be formally subleading from the 
viewpoint of our calculation but could be of non-negligible significance numerically. Hence 
while we do not intend to make a detailed quantitative comparison we do expect to find 
qualitative similarities with MC results. 

To make this comparison, we generate pp —> ZH events at 14 TeV using Herwig++ 2.7.0 
with the UE-EE-5-MRST tune [46] and constrain the Higgs and Z boson to decay hadronically 
and leptonically respectively. 

Each generated event is directly handed over to the Rivet package [47], which implements 
our analyses. We tag the signal jet as the highest px Cambridge/Aachen [48, 49] jet with 
R = 1 as implemented in FastJet package [50] and plot the fraction of jets which lie in mass in 
the window Mp ± 5M as a function of a generator level cut on jet transverse momentum for 
three separate values of 5M. To make a comparison with our ISR results we omit FSR and 
non-perturbative corrections including hadronisation and underlying event (UE) corrections, 
switching them off for the MC results. The resulting comparison is shown in Fig. 1, where 
results are displayed for the ratio of the signal efficiency to the lowest order result. We observe 
that the signal efficiency, from MC, decreases with transverse momentum for all values of 5M 
as indicated by our exponentiated result Eq. (2.12). By making the mass window wider, i.e. 
choosing a larger 5M, one of course obtains a smaller Sudakov suppression and hence a larger 
efficiency but starts to lose the association with a well defined signal peak. 

2.2 Final state radiation 

For the case of plain jet-mass we would expect that the correction due to final state radiation 
can be neglected in our region of interest where px 3> Mp or equivalently A < 1. Physically 
FSR is associated to the bb dipole originating from Higgs decay. It is captured within the fat 
jet as long as the FSR gluons are not radiated at angles beyond those corresponding to the jet 
radius R. Due to angular ordering however, we would expect that most of the FSR radiation 
from the b quarks is emitted at angles smaller than I n this limit the final state emission is 
always recombined inside the fat jet. To be more precise, large-angle radiation beyond the jet- 
radius R is cut-off by the ratio of the dipole size (bb opening angle, given by A/(z(l — z )), to 
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HerwigH—|- signal efficiency: Plain 


Analytic signal efficiency: Plain 




Pt [GeV] 


Pt [GeV] 


Figure 1. Comparison of MC (left) and analytic (right) Eq. (2.12) tagging efficiencies for a range 
of mass windows as a function of a generator level cut on minimum jet transverse momentum px- 
This result has been generated using Herwig++ 2.7.0 for pp —» ZH at 14 TeV with the Z decaying 
leptonically and H —► bb, setting Mh = 125 GeV. We have tagged the signal jet as the highest px 
Cambridge/Aachen jet with R = 1. In this figure we have generated events at parton level with ISR 
only and divided out the contribution due to the lowest order result in both panels for clarity. 


the jet radius squared. Upon integration over z such corrections translate into terms varying 
at most as A In A, which we shall neglect as they vanish with A. We have verified this with 
MC and find that for sufficiently large transverse momenta, the correction due to final state 
radiation is of negligible magnitude O (0.5%) when compared to ISR O (20%) for R = 1 and 
Pt > ^00 GeV. We shall need to consider FSR more carefully when it comes to analysing the 
taggers in future sections. 

2.3 Non-perturbative contributions 

In order to get a complete picture of the physical effects that dictate the signal efficiency we 
also need to study how the signal efficiency changes after including non-perturbative effects 
such as hadronisation and underlying event (UE). In order to estimate those effects we used 
Herwig+-1- 2.7.0 with improved modeling of underlying event [51] and the most recent UE- 
EE-5-MRST tune [46] which is able to describe the double-parton scattering cross section [52] 
and underlying event data from yfs = 300 GeV to yfs = 7 TeV. It can readily be anticipated 
that the underlying event effect in particular will significantly degrade the mass peak and 
hence lead to a loss of signal. 

For this study, we consider all final state hadrons to be stable, therefore we switch off the 
decay handler module in Herwig+-h In doing so, we eliminate the chance of b flavour hadrons 
decaying into invisible particles such as neutrinos. If one were to include hadronic decay via 
invisible particles, one notices a universal reduction in signal efficiency for each tagger due 
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HerwigH—|- signal efficiency: Plain 


Figure 2. An MC study of the impact of hadro- 
nisation and underlying event on the signal effi¬ 
ciency for a plain jet mass cut as a function of the 
minimum jet transverse momentum. One can see 
the sizeable impact of both hadronisation and es¬ 
pecially underlying event on the signal efficiency 
in the window SM = 16 GeV. Details of the gen¬ 
eration are as in Fig. 1 but now we also include 
FSR at parton level. 
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to a loss of signal mass resolution. This is particularly important for the jets formed from 
the decay H —)> 66 as compared to W/Z jets because these electroweak bosons instead couple 
strongly to light quarks. For further information on experimental techniques to mitigate the 
impact of these particular sources of missing transverse energy, see for example [53]. We also 
assume a 6-tagging efficiency of 100% which is sufficient for a relative comparison of tagger 
performance and behaviour. The reader is referred to Ref. [3] for a discussion on the impact 
of 6-tagging efficiency on signal significance. 

In Fig. 2 we see how non-perturbative effects such as hadronisation and underlying event 
affect the signal efficiency when using plain jets tagged with SM = 16 GeV. One immediately 
notices that whilst hadronisation has a more moderate effect on the signal efficiency, which 
however increases with p^ (more precisely like see Ref. [54]), the dominant contribution 
comes from underlying event contamination which reduces the efficiency at pt = 3 TeV from 
about 60 percent to around 20 percent. This implies simply that one needs to consider removal 
of the UE for efficient tagging, which we shall discuss when we come to the boosted object 
taggers. We have also presented results here for R = 1 and the averaged UE contribution 
to the squared jet mass varies as R 4 [54]. Thus working with smaller R jets one may expect 
this contribution to be less significant. One should of course consider also the presence of 
considerable pile-up contamination, which we do not treat in this paper (see [55-57] for 
discussion of pileup subtraction techniques), but to which the plain jet mass will also be very 
susceptible. 

For now it is evident (as is well known) that the plain jet, with a mass window cut, is 
not a useful option from the viewpoint of tagging signal jets due principally to effects such as 
ISR, UE and pile-up contamination. It however provides a reference point for the discussions 
to follow. 
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3 Trimming 


Trimming [21] takes all the particles in a jet defined with radius R and reclusters them into 
subjets using a new jet definition with radius -Rtrim < R- It retains only the subjets which 
carry a minimum fraction f cu t of the original jet transverse momentum j// ub -i ct ) > / cut x p 
and discards the others. The final subjets are merged to form the trimmed jet. 

It is standard to use the Cambridge-Aachen (C/A) jet algorithm [48, 49] for substructure 
studies with trimming (and other taggers) and this is what we shall employ here. 

3.1 Lowest order result 

Compared to the plain jet mass, trimming already has a more interesting structure even 
without considering any additional radiation. If the opening angle between the bb pair is 
less then -Rtrim then trimming is inactive. However, if the angle is greater than Rtrim, one 
removes the softer particle if its energy fraction is below / cut . The result for signal efficiency 
is therefore given by an integral over z which can be expressed as, 

4 0) = / dz (l -©(/cut -min [z,l-z]) 0 - z(l-z))^- (3.1) 

Strictly we should also have written above the condition for the hard prongs to be inside the 
fat jet as we did for the plain jet case. However since this condition only results in terms 
varying as A /R 2 we shall neglect it here, consistent with our approximation A /R 2 <C 1. 

The subtracted term in the above equation represents the removal of any prong that has 
energy fraction below / cu t, in the region where trimming is active. 

Evaluating the integral in Eq. (3.1) gives the result 
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For now we shall consider values of f cu t that are standard in trimming analyses and 
therefore are considerably smaller than 0.5. For such choices of / cu t the second term in 
Eq. (3.2), which requires / cut > 1/2, clearly does not contribute. While the result in Eq. (3.2) 
is general, let us for illustrative purposes consider values of -Rtrim not too small, such that 
A/^rim « I- Then Eq. (3.2) implies a transition point at A ~ /cut A t 2 rim , which translates to 
a transition point at pr = Mh / (Vfc ut-Rtrim) • We remind the reader that the mass distribution 
for background jets also had transition points at Mj/p^ = f cn tR 2 and Mj/p^ = /cut^trim 
[31, 32]. The latter transition point is coincident with that reported above for the signal and 
corresponds to the minimal jet mass that can be obtained with trimming for a splitting with 
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opening angle -Rtrim- As one increases p^ beyond /( f cut E'f r - rn ) the background distribution 
starts to grow due to the onset of a double logarithmic behaviour so the mistag rate increases. 

Below this value of px the signal efficiency is given by 1 — 2/ cut and is therefore px 
independent while above it one obtains 1 — and hence acquires a px dependence. We 
remind the reader that these results apply specifically to the Higgs decay and for processes 
involving W/Z tagging different results will be obtained. This is due to the different splitting 
functions involved in hadronic W/Z decay. 


HerwigH—|- signal efficiency: Trimming 


Analytical signal efficiency: Trimming 




Figure 3. Comparison of MC (left) with the analytic result Eq. (3.2) (right) with = 0.3 for the 
tagging efficiency for two values of / cut as a function of generator level of jet transverse momentum. 
This result has been generated using Herwig++ 2.7.0 at parton level with no additional radiation for 
H —>■ bb jets. We note that the location of the transition points are reproduced by MC. 

In Fig. 3 we compare the signal efficiency using Herwig++ 2.7.0 for trimming applied to 
boosted Higgs jets with no ISR, FSR or non-perturbative effects to the analytical calculation 
above Eq. (3.2). We generate the tagging efficiency with two different / cut values and for 
Rtrim = 0.3 as a function of px for both MC (left) and analytics (right). One observes, as we 
would expect, that the MC clearly reproduces the analytic behaviour of the tagger at lowest 
order and, for our choice of parameters, the expected transition points at around 1320 GeV 
and 1860 GeV for / cut = 0.1 and / cut = 0.05 respectively. 

3.2 Initial state radiation 

Let us consider the action of trimming on ISR and compare to the case of the plain jet. For 
the plain jet we found a large logarithmic term that results in loss of signal with increasing px- 
On the other hand we would expect trimming to substantially remove ISR radiation and hence 
wish to check the impact on the logarithmically enhanced terms that emerge from considering 
soft ISR. The key difference with the plain jet case is that when the angle between the ISR 
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gluon and the jet axis exceeds Rtrim the soft gluon is retained only if it has kt/pr greater 
than /cut; where kt is the transverse momentum of the soft gluon. If the kt fraction is below 
/ cu t the ISR emission is removed by trimming, thus not contributing to the jet mass, and 
hence in this region there is a complete cancellation with virtual corrections. Alternatively, 
if the ISR falls into the trimming radius, we always retain the emission, much like the plain 
jet case. These constraints on real emission can be expressed as: 


©ISR,trim = 0 (6» 2 - R t 2 rim ) ^0 (x - /cut) © “ X ^j + 0 (/cut “ ®)^ (3.3) 


+ 0(i? 2 rim -0 2 )© 


2 M h SM 
Pt (# 2 + A) 


— x 


where we defined x as kt/pr and 0 2 = rj 2 + </ 2 is the angle between the ISR gluon and the fat 
jet axis. 

One can then repeat the calculation carried out for the plain jet mass in the previ¬ 
ous section using the above constraint. Taking the ISR emission probability in the eikonal 
approximation as before, and incorporating virtual corrections we get (in a fixed-coupling 
approximation) 
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We can evaluate the integrals straightforwardly and again shall discard terms that are 
power suppressed in A. The result obtained has two distinct regimes: For f cut > 2§4M„„e 
gets an answer of the form 
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while for f cut < 2 ^a M , we get 
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Eqs. (3.5) and (3.6) basically tell us that for sufficiently large / cu t i.e. above 
eliminates the logarithm we obtained for the plain jet mass replacing it by a less harmful 
In 1 //cut, provided one chooses / cu t not too small. On the other hand for smaller / cu t we see 
a transition to the logarithmic dependence seen for the plain mass. There is an additional 
correction term in equation Eq. (3.5) that represents the region of integration with 6 2 < R 2 rim - 
This term vanishes as Rtrim —> 0 and suggests that choosing smaller Rtrim values will result 
in less contamination from ISR as one may readily expect. We shall however see later, when 
studying FSR radiative corrections, that we cannot choose R 2 rim too small, i.e. <C A, due 
to degradation of the jet due to FSR loss. If one chooses R 2 rim <S 1 but of order A, then 
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within our small A approximation we can simply ignore this term. If on the other hand 
one chooses i?trim to not be too small then at very high px one should also consider the 
presence of this term, which appears only for / cut > ■ For most practical purposes, 

with commonly used parameter values, this term can safely be ignored. For example with 
/cut = 0.1, -Rtrim = 0.3 and SM = 16 GeV, even at pt = 3 TeV it only contributes order 10 
percent corrections relative to the main lnl// cuti piece. 

In principle, we should also resum the logarithms of / cut that are obtained with trimming. 
Such a resununation is however also beset by non-global and clustering logarithms and there¬ 
fore highly involved. Moreover the In 1 // cu t terms also play only a modest role numerically, 
for typical choices of / cu t ~ 0.1 and thus their resummation is not particularly motivated on 
phenomenological grounds. We note that In 1 // cut enhanced terms are also produced in cor¬ 
responding calculations for QCD background [31] and were not resummed in that case either. 
Consequently, unlike the plain jet case, we do not exponentiate the radiative corrections to 
the signal efficiency for trimming, or any of the other taggers studied in this paper. 

Let us then compare the main features of our simple analytical NLO approximation, 
augmented to include running coupling effects, to what is seen in MC event generators. In 
Fig. 4 we again compare our analytical approximations, with running coupling effects as in 
the plain mass case Eq. (2.13), to Herwig++ 2.7.0. For the MC studies we turn on ISR effects 
with boosted H —>• bb jets for a range of f cu t values, as a function of jet px , keeping SM fixed 
at 16 GeV. 

Plotting the ratio of the ISR corrected signal efficiency to the lowest order result, we 
can see that the approximate NLO analytic result reproduces the MC trends reasonably well. 
For values of / cut = 0.1 and 0.05 we do not obtain a transition over the range of px values 
shown (transition points are expected at roughly 200 GeV and 280 GeV, which are beyond 
the range shown) and none is seen in the MC plots. The behaviour over the entire plotted 
px range is quite flat with px since it depends mainly on lnl// cut , with running coupling 
and uncalculated subleading effects (in the case of the MC results) providing the mild px 
dependence that is seen. Instead for / cut = 0.005 we would anticipate a plain mass like 
degradation of the signal efficiency until the transition at about 890 GeV and then for higher 
px a flatter behaviour with px, consistent with MC results. This relative flatness over a large 
range of px is of course in contrast to the pure mass cut case. 

3.3 Final state radiation 

Let us consider the response of trimming to final state radiation. In principle there are a 
number of parameters to be considered, in particular / cut , A and Rtrim as well as the mass 
window 5M and transverse momentum px- Final state radiation, when not recombined into 
the fat jet, results in a shift in mass which can cause the resulting jet to fall outside the 
mass window 5M . Imposing a veto on soft FSR that degrades the jet mass results in the 
appearance of large logarithms whose structure we examine here. Additionally a relatively 
hard FSR gluon can also result in one of the primary b quarks falling below the asymmetry 
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HerwigH—|- signal efficiency: Trimming Analytical signal efficiency: Trimming 



Figure 4. Comparison of MC (left) and analytic Eq. (3.4) (right) trimming (i?trim = 0.3) tagging 
efficiencies for a range of / cut values as a function of a generator level cut on jet transverse momentum. 
This result has been generated using Herwig++ 2.7.0 [44] at parton level with ISR only for H —► bb 
jets, setting Mh = 125 GeV and SM = 16 GeV. The transition points correspond to the change from 
plain jet mass like behaviour to a ln/ cut term, discussed in the main text. 

cuts that are used in taggers, and hence loss of the signal. Such hard configurations can still 
come with collinear enhancements and so their role should also be considered. 

In order for an FSR gluon to be removed by trimming it has to be emitted at an angle 
larger than i?trim wrt both the hard primary partons. In addition its energy, expressed as a 
fraction of the fat jet energy, must fall below the / cu t cut-off. Lastly for the resulting jet to 
be retained, the consequent loss in mass must be less than SM. 

One can therefore write the following result for real emission contributions, valid in the 
soft limit where the gluon energy cu <C px'- 
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(3.7) 

where dQ is the solid angle element for the emitted gluon and the spatial distribution of 
radiation has been expressed in terms of the standard antenna pattern with the notation 
(ij) = 1 — cos Qij. The condition simply represents that the angular integration should 

be carried out over the region where trimming is active i.e. when the emitted gluon makes 
an angle larger than i?trim with both b and b. Moreover there is an additional step function 
that constrains the energy fraction to be below / cut and the factor 0 (SM — (Mh — Mj )) 
represents the constraint on real emissions due to the mass window SM. 

There are three distinct regimes one can consider according to the value of Rt r \ m ■ Firstly 


- 16 - 













when one has -Rtrim <C 9 bb then one can expect a collinear enhancement with a logarithm in 
i? tr im, that accompanies a soft logarithm arising out of the 5M constraint. This should be the 
most singular contribution one obtains for trimming so we analyse it in more detail below. In 
the region where -Rtrim ~ 9 bb on the other hand there will be no collinear enhancement and 
one obtains a pure soft single logarithm. In this region trimming is similar to pruning and 
the rnMDT as far as FSR is concerned, and we shall comment on the results in somewhat 
more detail in the next section. Finally in the region where -Rtrim d bb , the FSR correc¬ 
tion for trimming becomes more like the plain jet where large angle corrections are strongly 
suppressed. 



Figure 5. A configuration which contributes to the FSR correction to trimming signal efficiency in 
the region Rtrim 9 bb . Here gluon k is emitted collinear to the b quark with momentum fraction x 
outside a cone with radius i?trim- 

For the soft and collinear enhanced region R t rim 9 bb , let us perform a more detailed 
calculation. First let us examine the loss in mass in more detail. One has: 

Mfj — Mj = 2 (pi, ■ k + p b ■ k) = oj (Eb9 bk + E b 9 | fc ) . (3.8) 

Consider first that the gluon k is emitted collinear to the b quark with momentum fraction 
x, as shown in Fig. 5. In this limit one can neglect the Eb9 bk contribution above and set 
d\ k ~ 9 2 b = Z (\L^ ■ Requiring Mu — Mj < 6M and neglecting terms of order 5M 2 relative 
to MhSM gives the mass window constraint 2 Mh 5M > upT^/z. Next, defining x as the 
energy fraction of the soft gluon wrt the energy of the hard emitting prong i.e. x = uj/(zpt), 
one can write the mass window constraint as the condition x < 25 M/Mjj , where we used 
the fact that A = M^/p?- Likewise the / cut cut expressed as a condition on x just gives 

fcut/z > X. 

In the collinear limit, one can also simplify the angular integration in Eq. (3.7) which 
then assumes a simple d6 2 /9 2 form, resulting in a logarithmic contribution, In £ b . It is also 
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possible to perform the angular integration exactly i.e. beyond the collinear limit, to account 
for less singular soft large-angle contributions. More details of the derivation and results on 
the angular integration are provided in appendix A. 

We can therefore express the soft-collinear contribution to the FSR corrections as: 
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where a factor of 2 has been inserted to account for an identical result from the region where 
k is collinear to b rather than b and virtual corrections have been introduced corresponding 
to the —1 term in square brackets. 

Now let us write 9 2 ^ = A/(z(l — z)) and carry out the integration over x which gives 


eSFSR = ~2Cf— [ dz f In —^-ln(z(l - z))] In — 0 (/ cut - ze) , (3.10) 

n Jfcnt V "trim / ~ e 

where we introduced e = . The structure of the above result is, in essence, a double 

logarithmic form with a soft divergence in the limit 5M —> 0 and an accompanying collinear 
divergence when Rtrim — > 0. Let us take R 2 r \ m <C A, 3 for simplicity ignore the accompanying 
ln( 2 :(l — 2 )) term, and integrate over 2 to generate the following result: 


(/cm, e) 0 (/cut - + ° 2 (/cut > e) 0 (rb _ /cut ) ’ 

(3.11) 

where 

Cl = (1 - 2/^) + (1 - 2/ cu t) In ^ In / cut - (1 - / cut ) ln(l - / cut ), (3.12) 

x, /Cut r n 1 1 

u-2 — /cut /cut ni — • 

e e 

We note that for values of e <C / cu t the signal efficiency will be dominated by a In 
term in the coefficient Ci. The presence of the / cut constraint however means that in practice 
such logarithms make only modest or negligible contributions for a wide range of values of e or 
equivalently 6M. This can be contrasted to the case of filtering, computed in Ref. [34] , which 
has an identical collinear divergence to that for trimming above, but where additionally the 
absence of an / cu t condition leads to a much stronger In ’ enhancement, which needs to be 
treated with resummation. It is also straightforward to include the effects of hard collinear 
radiation by considering the full p gq splitting function rather than just its divergent 1/x piece. 
In this region it is possible for the quark to fall below the / cu t threshold and therefore to be 
removed. Such corrections do not come with soft enhancements and produce terms that 
vanish with either / cut or e, hence do not have a sizeable numerical effect that would require 

3 In this limit, the leading order result Eq. (3.2) is simply 1 — 2/ cu t, i.e. the two subjets never form a single 
subjet and are always subject to the asymmetry condition x > f cut . 
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resummation. For this reason, we do not calculate these terms explicitly, continuing to work 
in the soft and collinear limit. 

To make the above statements more explicit let us consider the situation at px = 300 GeV 
and choose / cu t = 0.1. The zeroth order result for signal efficiency is then £g = 1 — 2/ cut = 
0.8. If one chooses a value of Rtrim = 0.1 then ln(A/R;( rim ) ~ In 17 and one may expect 
significant (collinear enhanced) radiative corrections. Choosing a larger R tr i m = 0.3 one can 
instead reduce In A/R/ rim to ~ In 2, which is not enhanced and does not require resummation, 
implying a much more modest FSR contribution. However we should also examine the effect 
of this increased R t ri m on ISR and UE contributions. 

For our choice of parameters it is evident from MC studies that we do not pay a significant 
price for the increased i?trim value in terms of the ISR contribution. At the same value of px 
the UE contribution for Rtrim = 0.3 is also small (see Fig. 6). If one moves to higher px , say 3 
TeV, one should correspondingly lower R tr i m . Here one has A = 0.0017 and choosing a value 
of Rtrim ~ 0.1 would ensure a small FSR contribution as well as reduce the impact of ISR and 
the underlying event. This illustrates that by an appropriate choice of Rtrim one can negate 
large radiative losses due to FSR, without necessarily suffering from large ISR/UE effects. In 
general the optimal value of Rtrim will involve a trade-off between FSR radiative corrections 
and ISR/UE effects. We shall return to this point in section 7. 

We should also examine the role of soft divergences that are formally important in the 
e — > 0 limit. Taking a value of 5M = 2 GeV leads to e = 0.032. For our choice of / cu t = 0.1, 
we have ln/ cut /e rs_/ In 3, which is not a genuinely large logarithm. The overall coefficient 
of — 2Cf^x In A/Rt rim , which is given by the C\ and C *2 terms in Eq. (3.12), is for 5M = 2 
GeV, approximately 1.58 and for SM = 10 GeV approximately 0.34, thus indicating that 
resummation of soft logarithms is not a necessity. Expressed as a percentage of the tree level 
result 1 — 2/ cu t, the FSR corrections, as computed above, constitute a roughly two percent 
to ten percent effect for 5M ranging from 2 GeV to 10 GeV, if one chooses R[( rim — A/2. 4 

Hence, we find that even the leading soft-collinear enhanced contribution makes only 
modest contributions to the signal efficiency, at best comparable to pure order a s corrections. 
The main implication of this finding is that full fixed-order calculations or combinations of 
fixed-order results with parton showers (see Refs. [58, 59] for a review of the latter methods), 
would give a better description of the signal efficiency than pure soft showers. We further 
explore in appendix B, in somewhat more detail, the role of fixed-order calculations in a 
description of the signal efficiency. 

Eq. (3.11) is intended to address the formal limit R/ rim <C A. It indicates that choosing 
such small values of Rtrim is problematic due to degradation of the jet from FSR loss. In the 
opposite limit i.e. R/ rim 3> A the FSR dependence will be similar to the plain jet mass i.e. 
one may expect FSR losses to be negligible. On the other hand Eq. (3.5) for ISR corrections 
warns us that large choices of Rtrim may not be optimal due to increased ISR (and UE) 

4 As we shall see later, this constitutes a somewhat non-optimal choice for Rtrim and is made here for purely 
illustrative purposes, in order to estimate the size of soft but non-collinear enhanced effects. 
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contamination. One is thus led to think about the region ~ A. This is reminiscent 

of the choice made in pruning for Rp rune ~ Mj/pj,. In this limit the behaviour of FSR 
corrections for trimming is therefore expected to be similar to that for pruning for which 
a detailed calculation is carried out in section 4.1. We simply note here that in the region 
-^trim ~ A FSR corrections are relatively modest and can be thought of as pure order a s 
corrections, rather than carrying significant logarithmic enhancements. 


3.4 Non-perturbative contributions 

Let us now study the impact of non-perturbative corrections to the signal efficiency using 
trimming on boosted Higgs jets. 


IlerwigH —\- signal efficiency: HcrwigH—|- signal efficiency: 




Figure 6. An MC study of the impact of hadronisation and underlying event (UE) on the signal 
efficiency using the the trimmed jet (/ cu t = 0.1) as a function of the minimum jet transverse momentum 
for two different values of f?trim- One can see the impact of hadronisation and underlying event on 
the signal efficiency in the window SM = 16 GeV. Details of generation given in Fig. 2. 

In Fig. 6 we show the signal efficiency for a boosted Higgs signal jet after application 
of trimming with parameters Rtrim = 0.3 (left), Rtrim = 0.1 (right) with / cut = 0.1, as a 
function of the jet transverse momentum. One can see that hadronisation has little effect 
on the tagging rate of signal jets, due to the action of trimming on contributions which are 
soft and wide angle in the jet. UE has a larger impact on the signal efficiency due to soft 
contamination which is not checked for energy asymmetry. In other words inside the trimming 
radius the algorithm is inactive, and we automatically include all contamination coming from 
UE, which inside this region would contribute on average to a change in the jet mass squared 
varying as RjL m - The UE contribution could thus be substantially reduced by choosing a 
smaller This is in particular required at higher px as evident from Fig. 6. Also, in 


- 20 










contrast to the plain jet result in Fig. 2, one notes a significant reduction in sensitivity to 
non-perturbative effects when tagging signal jets using trimming. 


4 Pruning and mMDT 

In this section we shall study pruning [19, 20] and the modified mass drop tagger [3, 31]. We 
describe these methods together because unlike for the case of the QCD background studied 
in detail in Ref. [31] where the taggers can exhibit substantial differences, for the signal one 
finds quite similar behaviour. 


4.1 Pruning 

Pruning uses the initial jet to calculate a pruning radius which is dependent on the mass 
of the jet and its transverse momentum i? prune = Rf ac t x where Rf ac t is a parameter 
of the tagger. It proceeds by reclustering the jet, at each step checking if both the angle 
between the two objects % and j is greater than the pruning radius AR t j > R prune and the 
splitting is pr asymmetric i.e. min (pt, , Pt, ) < £ C ut x PT {1+f) ■ If these conditions are both true, 
pruning discards the softer of i and j, else i. j are combined as usual. This is repeated for 
each clustering step until all particles are either discarded or combined into the final pruned 
jet. For this study we use the default value Rf ac t = \ [19] and again use the C/A algorithm 
to both find and recluster the jets. 

At zeroth order the two signal prongs are always at an angle larger than R pmne and so 
the result is simply 1 — 2z cu t- For initial state radiation one can consider pruning to be similar 
to trimming with Rtrim replaced by R pmne . The pruning radius is given by 
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where 6 is the angle between the soft gluon and the Higgs direction (or equivalently, with 
neglect of recoil against soft ISR, the fat jet axis). 

One then ends up comparing the gluon angle 9 2 to x9 2 + A and thus for sufficiently soft 
emissions i.e. in the limit x —> 0, responsible for logarithmic corrections, one can just replace 
Rp mne by A. The situation is therefore identical to trimming but with R/ rim replaced by A. 
Since we work in the limit A < 1 we can neglect corrections varying as powers of A, that 
replace the R t rim dependence in Eq. (3.5). The result should then be identical to trimming 
in that one should obtain a In —, dependence with a transition to the plain mass behaviour 
visible for smaller £ cut values as in Fig. 4. We have verified that this is indeed the case with 
MC and that the efficiencies for pruning and trimming look essentially identical in terms of 
the response to ISR. An MC plot comparing ISR for all taggers is shown in the next section 
(Fig. 10), after we discuss the cases of rnMDT, Y-pruning and Y-splitter. 

Next we discuss briefly the situation with regard to FSR, in the context of pruning. Again 
one can employ the insight we gained in the previous section for the case of trimming. For 
the case of pruning there is no collinear enhancement since radiation that is lost is emitted at 
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an angle (wrt both hard prongs) larger than i?p rune ~ A = z( 1 — z)6k i.e. essentially of order 
9 2 ^. Thus the angular integration produces a finite O (1) coefficient and we will thus obtain 
only a single logarithmic enhancement, that results from the loss of soft radiation at relative 
large angles i.e. those comparable to the bb dipole size. The corresponding loss in mass can 
be expressed as 


Mjj - Mf = (pi +p 2 + k) 2 - (pi + p 2 ) 2 = 2k ■ (pi + p 2 ) ~ up T ( 0 2 + A) , (4.2) 


where 6 is the angle between the gluon and the jet direction. Noting that 0 2 is at most of 
order A (contributions from the region where 9 2 A are negligible due to angular ordering 
of soft radiation) one can replace Mjj — M 2 by upx A, ignoring any multiplicative factors of 
order one, that lead to only subleading logarithmic terms. The condition on gluon energy 
due to the mass window constraint is then 2 > JiL_ One also requires a constraint on the 

gluon energy such that it fails the 2; cu t condition ’. In the soft limit and to leading logarithmic 
accuracy this condition is just that ^ < z cut - Denoting ^ by x, and accounting for virtual 
corrections as for the case of trimming, we have the expression for FSR corrections to pruning: 


J 1 ) 

e S,FSR 




-C F 


Qts 

7T 


^cut 


dz 


dx 

x 


0 (Zcut 


x) 0 (x — e) 


(bb) d£l FSR 
(bk)(bk) 2tt prune ’ 


(4.3) 


where the condition 0pnine is simply the condition that the gluon is emitted outside an angle 
Rprune wrt both hard prongs. The angular integration and z integration is performed in 
appendix A and carrying out also the integrals over x we obtain the result: 


(1) ^ 27T Zcut 

e S,FSR — — ln ^cut > e- 


(4.4) 


The above results suggest that logarithmic enhancements for pruning are in principle 
present for e <C z cu t- However even with a choice of SM as low as 2 GeV, with a choice of 
£cut = 0.1 we obtain a modest logarithm ~ In 3, implying that soft enhanced effects can be 
neglected. To therefore assess FSR corrections in more detail, as we found for trimming, it is 
necessary to go beyond the soft approximation and study hard corrections, which we explore 
further in appendix B. However it should be apparent that radiative corrections due to FSR 
corrections represent essentially order a s corrections without significant log enhancements 
over a wide range of values of SM, px and z cut . We can exploit this stability against radiative 
corrections in optimising the tagger parameters, which we do in a subsequent section. 

To obtain a complete picture of pruning we also need to account for non-perturbative 
corrections arising from hadronisation and UE corrections. We shall comment on MC results 
for these aspects together with the mMDT results below. 

5 Strictly, with the precise definition of the z cu t condition we would have to consider a cut on u> normalised 
to the energy of the corresponding declustered prong i.e. zpr or (1 — z)pr but at single-log accuracy one can 
just always take a cut on lo/pt- 
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4.2 Modified mass drop tagger 

Here we shall consider the modified mass drop tagger along similar lines. We start by recalling 
the definition first of the regular mass drop tagger: The mass drop tagger (MDT) [3] is 
intended for use with jets clustered using the C/A algorithm. For each jet j one applies the 
algorithm: 

1. If jet j contains subjets, split j into two subjets j\ and j 2 by undoing the last stage of 
clustering such that mj 1 > m,j 2 . 

2. If there is a significant mass drop /i x rrij > rrij 1 and the splitting is not too asymmetric 
min {pt jxiPTj^j ^^‘ji ?2 / rn j > Vcut deem j to be a “tagged” jet and exit the loop. 

3. Else relabel j\ = j and repeat from step 1. 

The modified mass drop tagger corrected a flaw in the mass drop tagger so that in the 
event that the mass drop and asymmetry conditions are not satisfied one follows the more 
energetic (higher px branch) rather than the heavier branch j\ as advocated above. This is 
not only physically relevant (as one ensures that one identifies hard substructure rather than, 
for a small fraction of events, following soft massive jets) but also ameliorates significantly 
the perturbative structure for calculations related to QCD background jets, rendering for 
instance the QCD jet mass distribution purely single logarithmic and free from non-global 
logarithms. 

One other observation that was made in Ref. [31] concerned the role of the mass drop 
parameter p itself. There it was noted that the mass drop condition had a negligible impact 
on the result obtained for the jet mass distribution for QCD background jets and hence that 
it was possible to entirely ignore the mass drop requirement. For the current paper we shall 
consider this variant of the rnMDT, where we do not impose the mass-drop condition but just 
the asymmetry requirement via a y cu t cut-off. 

At zeroth order we obtain a signal efficiency e® = 1 — 2 y cnt coming from the asymmetry 
cut, which is the same result as for pruning. As far as the response to ISR is concerned, one 
can straightforwardly see that the general behaviour will be similar to the taggers we have 
considered before. Consider a fat jet consisting of a bb pair and an ISR gluon. If the gluon 
makes an angle less than 9 bb with either of the hard prongs of the fat jet then on declustering, 
we will break the jet into a massless prong and a prong with a small mass consisting of a quark 
and the soft gluon. In the soft limit the asymmetry condition will pass if the hard prongs are 
sufficiently energetic, i.e. exactly as at zeroth order, and the soft ISR will contaminate the 
jet. Such corrections will vanish with Mh/pt just like for the case of pruning and hence we 
can ignore them here, at high px- For relatively large angle ISR, the soft gluon emerges first 
on declustering the jet. If it fails the asymmetry condition it is removed and its effects cancel 
against virtual corrections. If it passes the asymmetry cut one obtains, in the small A limit, 
essentially a logarithm in y cu t as for pruning and trimming. One should note that, if one uses 
the asymmetry condition, with the y cu t measure exactly as defined above, the condition for 
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the gluon to pass the asymmetry can be expressed as x 2 9 2 / (A + x9 2 ) > y cut , where x = ui/px- 
Combining this with the condition 9 2 < R 2 , on the angular integration, we get the constraint: 


x > 


2/cut 

~Y 



\/ 1 + ^ Sp ) 


(4.5) 


which for y cu t A / R? reduces to the same constraint as for the case of pruning and trimming 
i.e. x > ycut- The main effect of this slightly different relationship between the gluon energy 
and y cut manifests itself, only for rather small y cut values and at low px, in terms of a change 
in the transition point to the plain jet mass like behaviour. The position of the transition 
can be computed as before by setting the RHS of Eq. (4.5) equal to 2M h$m reca ]}j n g that 
we take the fat jet radius R = 1. Let us consider / cut = 0.005 and 6M = 16 GeV, for which 
one obtains a transition point with trimming at px ~ 900 GeV, below which one sees a plain 
jet like behaviour. For the same value of y cu t the corresponding transition point for mMDT 
occurs at roughly px ~ 400 GeV, i.e. is absent over the range of px considered here. In any 
case very small values of y C ut would mean that large logarithms in y CVL t become important 
and hence should in general be avoided. For reasonable values of y cu t ~ 0.1, mMDT behaves 
essentially identical to pruning and trimming. We have verified all of the above points with 
MC studies and shall provide a plot comparing the response of taggers to ISR in the next 
section, after discussing Y-pruning and Y-splitter (see Fig. 10). Lastly we shall mention that 
for FSR corrections in the soft approximation, we do not observe any significant differences 
between mMDT and pruning. To understand this it is enough to realise that a soft FSR 
gluon emitted at an angle smaller than 9 h i wrt either the b or b direction is not examined for 
the asymmetry condition and hence does not contribute to a loss in jet mass, implying also 
the absence of any collinear enhancements. Emissions at an angle larger than Q 2 ^ = ^ 

contribute to a loss in mass and give a soft single logarithmic contribution identical to that 
for pruning. The result obtained is identical to Eq. (4.4) with the coefficient 27r/v / 3 ~ 3.63 
replaced by a coefficient that we have determined numerically. For px = 3 TeV the coefficient 
we obtain is ~ 0.646. 6 The key point however is that no large logarithmic corrections arise 
due to soft FSR emissions, owing to the presence of the y cut cut-off. Once again it would 
therefore be of interest to study the role of genuinely hard radiative corrections beyond the 
eikonal approximation, a study we carry out in appendix B. 


4.3 Non-perturbative effects and MC results 

We have analysed the effects of ISR and FSR for both pruning and mMDT and concluded 
that the taggers have an essentially similar behaviour for the case of signal jets. Of course 
our studies have focussed thus far on the perturbative component and hence it is prudent to 
examine non-perturbative effects before reaching any firm conclusions. 

We show in Fig. 7 the MC results for pruning and mMDT. One observes that the signal 
efficiency has only a weak dependence on px and that relative to the lowest order expectation 

6 This is the same result, J (1), found by Rubin in Ref. [34] for the coefficient of the FSR soft logarithm for 
filtering when Jt fl i te r = 0 b5 . 
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Figure 7. An MC study of the impact of hadronisation and underlying event (UE) on the signal effi¬ 
ciency for pruning (left) and mMDT (right) (z cut , y c ut = 0.1) as a function of jet transverse momentum 
with SM = 16 GeV. Details of generation are given in Fig. 2. 


of £g = 0.8, at high pp one sees a roughly 10 percent difference for the full parton level result 
with radiative corrections. One also sees a remarkable similarity between the two taggers 
over the entire pp range as far as parton level results and those including hadronisation 
are concerned. The UE contamination is however more clearly visible in the mMDT case 
towards lower pp values which owes to the larger effective radius 6 h z = -as compared 

Pt^z(I-z) 

to -Rprune ~ Mh/pt for pruning as well as differences in the definitions of the asymmetry 
parameters y cut vs z cut . 7 

At lower pp therefore it has been standard practice to use the mass drop tagger in 
conjunction with filtering as suggested in the original reference Ref. [3]. One should also 
bear in mind the results of Ref. [31] where for QCD background jets much more pronounced 
non-perturbative effects were observed for pruning than for mMDT, and in the final analysis 
one expects the impact on the background to dictate the ultimate performance of the taggers, 
rather than the comparatively small corrections one sees here for the signal, over most of the 
pp range studied. 

A final point to make about Fig. 7 is about the contrast between the FSR corrections 
observed for mMDT and pruning to those seen in Fig. 6 for trimming. To make the comparison 
we note the fact that for Fig. 6 we have chosen / cut = 0.1 and consider R t rim = 0.1. Then the 
zeroth order result for trimming is simply 1 — 2/ cu t as for mMDT and pruning, within the pp 
range we are studying. It is evident from Figs. 6 and 7 that while the FSR results for mMDT 

' It is of course possible to use mMDT with a z cut constraint defined as for pruning instead of y cut , as was 
studied in Ref. [31]. This choice would further enhance the similarity we observe for signal jets and is the 
default in the current public implementation of mMDT in FastJet [50]. 
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and pruning show hardly any dependence on px over the range studied, the corresponding 
results for trimming show a more pronounced px dependence. This feature already emerges 
from our simplified fixed-coupling analytics where the FSR corrections for trimming depend 
on px via a dependence on In A/ R. 2 rim while for pruning and rnMDT we have shown that the 
FSR corrections are px independent (see e.g. Eq. (4.4) for pruning). We shall return to this 
point in section 7. 

5 Y-pruning and Y-splitter 

In this section we shall study the Y-pruning modification of pruning suggested recently [31], 
along with the older Y-splitter method [2], We shall study these two methods together 
because they have a remarkably similar action on QCD background jets, and are particularly 
effective in cutting out QCD background in the vicinity of signal peaks, for boosted Higgs 
and electroweak gauge bosons, which makes them potentially valuable tools. They however 
differ very significantly from each other and from other taggers in their response to initial 
state radiation (and even more significantly to UE), for reasons we highlight below, and which 
were also mentioned in the case of Y-pruning, in Ref. [31]. In the next subsection we shall use 
the insight we gain in the present section to suggest improvements to Y-splitter in particular. 

5.1 Y-pruning 

We begin by examining the case of Y-pruning. Let us recall that this is a modification of 
pruning where one requires that at least one clustering is explicitly checked for and passes 
the pruning criteria else one discards the jet. In this way one removes spurious configurations 
where all emissions that are left after pruning is applied, are within an angular distance R piU ne 
of one another and hence never get examined for an asymmetry condition, resulting in the 
tagging of structures with only a single hard prong. 

A known issue with Y-pruning for the case of signal jets, already discovered in Ref. [31], 
concerns its response to soft wide-angle emissions from ISR or UE. Here one can have a 
situation where a soft ISR emission contributes to setting the pruning radius but is itself 
removed by pruning. If the pruning radius set by the ISR emission is larger than 9 bb then 
one would discard the resulting jet as it does not satisfy Y-pruning, causing a loss of signal. 
In the same kinematic region virtual corrections would lead to a jet accepted by Y-pruning 
(assuming the hard prongs arising from Higgs decay satisfy the z cu t criterion), and hence 
contribute to the signal tagging efficiency, as we shall demonstrate below. 

We first consider that for an ISR emission with energy, or equivalently kt, fraction i< 1 
which makes an angle 9 with the fat jet axis (again we neglect recoil against soft ISR), we 
have that Rp rune = Mj/p^ « A + x9 2 . We wish to compute the virtual contribution in the 
region where the real ISR is removed i.e. in the kinematic region where 9 2 > Rp rune and 
where x < z cu t- Moreover we require that Rp rune > 9 ^ so that the hard prongs are inside the 
pruning radius. 
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Thus we have, for the contribution of uncancelled virtual gluons, the equation 

f1 Zcut 


Aes = — Cf 


at.* 


7 r 


' -^cut 


dz j ^ dO 2 0 ( 9 2 - A) 0 (V - ^/( z )^ Q(R 2 - d 2 )&(z cut - x ), 


(5.1) 

where we have denoted the extra contribution for Y-pruning, relative to pruning, by Aes, 
have defined the function f(z) = — 1 and where the overall minus sign indicates that 

we are considering the virtual contribution. The first step function in the above equation 
comes from the requirement that we are considering the region where the ISR emission lies 
at a larger angle than i? prune relative to the jet axis (with x <C 1 ) while the second step 
function expresses the constraint that the bb opening angle is less than -R pm ne, the pruning 
radius. Lastly we have conditions corresponding to the ISR radiation being in the fat jet 
R 2 > 9 2 and the energy condition x < z CVLt corresponding to removal of the corresponding 
real radiation. We can straightforwardly carry out the integrals over 6 2 , x and z. In particular 
the integration over x produces the logarithmically enhanced term we seek, where one obtains 
a logarithm in the ratio of z cut to A. One may expect that this logarithm becomes large and 
hence should have a visible effect at high px, for values of z cu t that are not too small. The 
final result, discarding all other terms that are less singular in the high px limit (e.g. those 
that vanish with A), is 


Aes 


—C f —R 2 In AHL^-0 (/J — 3 ) 
7r A 




"cut (1 



+ (1 — 22 cu t )0 



■Zcut) 



(5.2) 


where we defined (3 = Zcu ^ . 

At high pt therefore, Aes potentially dominates the normal logarithmic dependence of 
pruning on z CVL t- We can therefore distinguish Y-pruning from other taggers by looking at the 
transverse momentum dependence of the signal response to ISR in the high px limit. 

In Fig. 8 we compare the sum of the analytic result from pruning and the additional 
contribution described in Eq. (5.2) to MC with ISR for a range of ^ cu t values. One immediately 
notices, in both analytical and MC plots, that the px dependence of Y-pruning is significantly 
different from that of pruning for the commonly used value z cu t = 0.1, (see Fig. 10). In 
agreement with our expectations the signal efficiency as given by MC in Fig. 8 first increases 
with px as for the case of pruning and then decreases beyond a certain point which we expect to 
be the onset of the logarithmic behaviour we have computed above. Our calculation indicates 
that the onset of the logarithm in z cut /A is for j3 > 3 which for z cut =0.1 corresponds to 
approximately 680 GeV and for z cut = 0.05 to approximately 970 GeV. This is consistent 
with what is seen in MC though of course the transitions are not as sharp as in the analytical 
result. 

As far as final state radiation is concerned there is no significant difference between Y- 
pruning and pruning. The soft large-angle contributions that are responsible for the loss of 
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Figure 8. Comparison of MC (left) and analytic (right) Y-pruning signal tagging efficiencies for a 
range of z cut values as a function of a generator level cut on jet transverse momentum. This result 
has been generated using Herwig++ 2.7.0 [44] at parton level with ISR only for H —► bb jets, setting 
Mff = 125 GeV and SM = 16 GeV. We have divided out the contribution due to the born configuration 
in both panels for clarity. 
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Figure 9. An MC study of the impact of hadroni- 
sation and underlying event (UE) on the signal ef¬ 
ficiency using the the Y-pruned jet (z cu t = 0.1) as 
a function of the minimum jet transverse momen¬ 
tum. One can see negligible impact coming from 
hadronisation and but some degradation coming 
from underlying event contamination on the sig¬ 
nal efficiency in the window SM = 16 GeV. Details 
of generation given in Fig. 2. 

500 1000 1500 2000 2500 3000 

Pt [GeV] 


0.9 

0.8 


0.7 


0.6 


R) 


0.5 

■ 

0.4 

■ 

0.3 

FSR 


FSR+ISR - 

0.2 

Hadronisation — - — - 

n i 

Hadronisation+UE . 


signal we saw for ISR are strongly suppressed for the case of FSR, due to the colour singlet 
nature of the parent Higgs particle and angular ordering. We conclude by showing in Fig. 9 
an MC plot for Y-pruning at both parton level (including both ISR and FSR) and with 
non-perturbative corrections. As expected there is some significant loss of signal due to UE 
contributions for precisely the same reasons as for the case of ISR and as also observed in 
Ref. [31]. In spite of this deficiency it was also shown in Ref. [31] that due to its strong 
suppression of background jets Y-pruning produced a signal significance that was at least 
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comparable and at high px exceeded that from the other taggers studied (mMDT, pruning 
and trimming), especially for gluon jet backgrounds. In the next section we shall study an 
older method, Y-splitter, that has a similar action to Y-pruning for background jets, which 
makes its action on signal worth exploring further. 


5.2 Y-splitter 

The Y-splitter technique was first introduced in Ref. [2] in the context of W boson tagging. 
The main observation was that the kt distance measure (as employed in the kt algorithm [60] ) 
between the two partonic prongs of a IT decay tended to be close to the IT mass, which is a 
consequence of a typically symmetric energy sharing between the two prongs, in contrast to 
the case of QCD background where the energy sharing is typically asymmetric. To exploit 
this fact one takes a fat jet constructed with the kt algorithm and undoes the last step of the 
clustering. This produces two prongs and we let them have energy fractions z and 1 — z. The 
kt distance dij is given, at small opening angles, by the square of the transverse momentum 
of the softer prong wrt the direction of the harder prong which gives: 


dij = min(z, 1 — z) 2 pxO, 


ij- 


(5.3) 


One can either cut directly on this distance by requiring it to be of order M (y {Mjj hi 


our case) or cut on the ratio of dij to the jet invariant mass squared M 2 = p\z(l — z)9 2 _■ ( 


see 


e.g. Ref. [29, 61]). In the present case we shall choose the latter option and hence demand 
that 

min( 2 :, 1 — z) 


din 

Mf 


f n \ > 2/cUt- (5-4) 

max(z, 1 — z) 

If this condition is satisfied then one tags the jet else one discards it. Taking for instance 
z < 1 — z the cut amounts to requiring that z > y c ut/(l + 2/cut) = 2/cut — O (//cut)- Likewise 
for z > 1 — z one obtains z < 1 — 2/cut + O (//cut) • 

The Y-splitter method has not been as widely used in recent times as some of the other 
methods we have studied here, though one relatively recent application has been for the 
purposes of top tagging [15]. Also, a detailed comparison of Y-splitter with N-subjettiness 
was carried out in Ref. [62], 

Let us first consider the action of Y-splitter on QCD background. If one considers a 
quark jet with an additional soft-collinear gluon emission, then the y cu t condition is active on 
the gluon energy, which means it regulates the soft divergence associated to gluon emission. 
The usual double logarithmic structure of the QCD jet mass gives way to a single logarithmic 
answer precisely as for the mMDT, pruning and Y-pruning methods [31]: 


da (Y-s P htter,LO) ^ 

~C F — 

<7 dp 7T 


In 


//cut 


) P < 2/cut) 


(5.5) 


M? 


with p = 3 r2 and where we neglected terms varying as powers of y cu t. For p > y cut one 
obtains a transition to the normal jet mass result. 
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At all orders the result for Y-splitter can be derived using methods similar to those in 
Ref. [31]. Since the derivation of this result takes us away from our current focus on signals, 
we shall not provide it here, but shall do so in a forthcoming paper [63]. The basic fixed 
coupling result, for small p can be expressed in the form: 


p dc ( Y_s P litter ) a a f. 1 

- C F In- 

cr dp tt \ y cut 



C F&s ^2 1 
27t p 


(5.6) 


which represents a Sudakov suppression of the leading order result. The form of this result 
is identical to that derived for Y-pruning in the region p < z 2 ut and when a s In O-y In ^ <C 1 
(see Eq. (5.10b) of Ref. [31]), though subleading logarithmic terms will differ. One can verify 
this similarity of Y-splitter to Y-pruning, for the case of QCD jets, by examining the results 
produced by MC and we shall do so in the next subsection. 

Next we shall study the response of Y-splitter to signal jets, for our case of Higgs decays. 
At zeroth order the result is similar to that for mMDT and pruning and with neglect of order 
y 2 ut terms one simply gets £g°^ = 1 — 2y cut , which is as usual a consequence of the uniform z 
distribution and the asymmetry cuts on z. 

Beyond zeroth order however one should expect very significant differences between 
Y-splitter and the other taggers. These come essentially from the response to ISR (and 
UE and as one can expect also from pile-up). In order to understand the ISR response let us 
consider our usual configuration of a bb pair and a large-angle soft ISR gluon with 0 ~ 1 > 6 b j y 
There are two configurations of interest. Firstly when the distance djj between the ISR gluon 
and both the hard prongs is larger than that of the bb quark pair one examines k 2 /M 2 where 
kt is the transverse momentum of the gluon wrt the jet axis, and require that this be greater 
than j/cut) f° r Hie jet to be retained. Also we require that the mass window constraint is 
satisfied as for the plain jet mass. 

On the other hand when the dij between the gluon and the bb pair is smaller than that 
between the b and b prongs, the gluon is simply clustered into the jet and one just imposes 
the y cu t condition on the two hard prongs as at zeroth order. The gluon thus contaminates 
the jet and one has to impose the mass window constraint again as for the plain jet mass. 

In fact one can argue that the first configuration where the gluon has a larger kt distance, 
along with the fact that it should not be too energetic so as to comply with the mass window 
constraint, is limited to a small corner of phase space that vanishes with 5M/px , where SM, 
as before, is the size of the window. 

To see this most straightforwardly one notes that the gluon has a kt distance from the 
bb pair (or equivalently in our soft large-angle approximation from either the b or b) which is 
given essentially by dij = x 2 p^0 2 — x 2 p\, where x = uj/pt■ On the other hand we have the 
kt distance between the b and b is min (z, 1 — z) 2 p^0 2 r, and let us for convenience suppose 
that z < 1/2. One requires therefore that x 2 p j, > z 2 p^0 2 j ) , while the mass window condition 
again for 0 2 ~ 1 A gives x < 2Mh 2 sm . These conditions are only simultaneously satisfied 

if z/{ 1 — z) < (2 SM/pt) 2 , which corresponds to a negligibly small region of phase space and 
given the uniform distribution in z, can be ignored. Hence we are left with the situation 
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Figure 10. A MC study of the impact of 
ISR on the signal efficiency for various taggers 
(2/cut, -cut, /cut = 0.1, Atrim = 0.3) as a function 
of the minimum jet transverse momentum. One 
can note the similarity of mMDT, pruning and 
trimming while Y-splitter and Y-pruning are dif¬ 
ferent, with Y-splitter in particular virtually in¬ 
distinguishable from the plain jet mass. Details of 
generation given in Fig. 1. 


that, modulo small corrections, the ISR gluon contaminates the jet and gives a result that is 
essentially like the plain jet mass. This implies considerable degradation in mass due to ISR 
and further due to UE and, of course in the final analysis, pile-up. 

Let us compare Y-splitter with the other taggers and the plain jet mass using MC. In 
Fig. 10 we show the response of all taggers studied thus far to ISR, as a function of jet px- 
One can immediately see that Y-splitter and the plain jet mass are essentially identical. One 
also notes that mMDT, trimming and pruning have a very similar behaviour to one another 
as we expected from our analytical estimates while Y-pruning suffers at high px as already 
observed, while still remaining far better than Y-splitter. 

As far as FSR is concerned, one does not expect any significant issues with Y-splitter. 
In contrast to ISR a soft FSR gluon will nearly always be clustered with the hard emitting 
partons, as a consequence of its softness and angular ordering, and so end up as part of the 
fat jet, thus not contributing to a loss in mass. Its effects will cancel against soft virtual 
corrections leaving us to study genuinely hard non-collinear configurations which ought to 
have a relatively modest impact at the level of pure order a s corrections. 

As far as NP effects are concerned we have also carried out MC studies for Y-splitter 
with hadronisation and UE. The findings here are that the effects are comparable in size to 
the plain jet mass. 

Thus in the final analysis it appears that Y-splitter may not be as useful as the other meth¬ 
ods studied here and in particular Y-pruning, even though it shares a very similar suppression 
of the QCD background. While Y-splitter appears effective at identifying hard substructure 
and removing background, it is not effective in grooming away soft contamination, as is in¬ 
built to varying extents in the other methods we have studied. This suggests using Y-splitter 
along with another method more effective in grooming may alleviate some of the issues we 
see with the signal. Therefore in the next subsection we shall consider its combination with 
trimming, which we find has some noteworthy features and produces interesting results. 
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Figure 11. Signal efficiency for tagging hadronic 
H jets using Herwig++ 2.7.0 [44] with underlying 
event and hadronisation as a function of a gen¬ 
erator level cut pt on transverse jet momentum. 
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6 Y-splitter with trimming 

Here we shall study the combination of Y-splitter with trimming, in view of the lack of any 
effective grooming element in Y-splitter, as mentioned above. We do not have to necessarily 
choose trimming in this respect and it is possible to study a combination of Y-splitter with 
other methods such as rnMDT and the recently introduced soft drop method [33]. Indeed 
it has been known for some time that combinations of substructure tools can often produce 
better results than the individual tools themselves [64] and thus one may hope to improve 
the performance of Y-splitter using a suitable complementary tool. 

We first study the impact of applying trimming on signal jets that are tagged by Y- 
splitter. For the combination of Y-splitter with trimming we choose f cut = y c ut = 0.1 and 
Rtrim = 0.3. 

The MC analysis is shown in Fig. 11 which demonstrates that the use of trimming sub¬ 
stantially ameliorates the loss of signal we saw with Y-splitter. It is evident from the same 
figure that while Y-splitter with trimming still does not reach the signal efficiency of some 
other methods, the difference is much less pronounced than before. In fact one observes that 
the signal efficiency for Y-splitter with trimming bears a qualitative similarity to Y-pruning. 
The reason for this is that the use of trimming turns the plain mass like behaviour of Y- 
splitter into the behaviour for trimming, except for configurations that have been rejected by 
Y-splitter, on which subsequent trimming does not act. This corresponds to the Y-splitter re¬ 
jection region whereby an ISR gluon has the largest kt distance but fails the y cut requirement. 
This kinematic configuration is reminiscent of that which resulted in the extra Aeg term for 
Y-pruning. In the present case uncancelled virtual corrections integrated over the Y-splitter 
rejection region produce a term ~ — Cf^R 2 In %=, which corrects the simple trimming re¬ 
sult. However the overall efficiency remains considerably higher than the Y-splitter or plain 
mass result, due to elimination of the plain jet mass like logarithm. 

Next one should study also the impact of using trimming in conjunction with Y-splitter 
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Figure 12. MC results for the differential jet 
mass distribution using Y-splitter with trimming 
(j/cut = 0.1,-Rtrim = 0.3), Y-splitter (y C ut = 0.1) 
and Y-pruning (2 cut = 0.1). This result has been 
generated using Herwig++ 2.7.0 [44] at parton 
level. A minimum pp cut on generation of the 
hard process qq —► qq was made at 3 TeV for 14 
TeV pp collisions. We take the mass of the two 
hardest jets found using the C/A algorithm with 
R = 1.0 and apply the algorithms with a fixed 

Zc.nt/y cut • 


on the QCD background. In this article, given our focus on signal jets, we shall not attempt to 
provide a detailed analytical study of the background case, which together with the derivation 
of the basic Y-splitter formula Eq. (5.6) we shall carry out in a forthcoming article [63]. Here 
we shall confine ourselves to MC studies and the result of these is shown in Fig. 12. Once 
again we study the action of trimming on jets that have been tagged by Y-splitter where we 

choose / cu t = ycut = 0.1 for both Y-splitter and trimming and Rtrim = 0.3. In this plot, we 

M? 

remind the reader that p = p2 3 R 2 he. the normalised jet mass as defined and used in Ref. [31]. 

One notes from Fig. 12 that Y-pruning, Y-splitter and Y-splitter+trimming all have a 
fairly similar action on background jets and provide, for our choice of parameters, a significant 
suppression of background around the signal mass-peak ~ 100 GeV. These results are for 
quark backgrounds but similar results are obtained for gluon jets. It is noteworthy that the 
action of trimming for the chosen parameters, appears only to have an apparently subleading 
effect and hence the desirable property of Y-splitter, that of reducing background via a 
Sudakov suppression term (see Eq. (5.6)), is largely unaffected. Such findings are certainly 
worthy of analytical follow-up for general choices of parameters, which we shall provide in 
our forthcoming work. 

Given the improvement in signal efficiency that we have achieved with Y-splitter with 
trimming, and the fact that the backgrounds are comparably (and in fact apparently some¬ 
what more) suppressed compared to Y-pruning in the mass region of interest, it is worth 
examining the signal significances (i.e. the ratio es/y/^B of signal efficiency to the square 
root of background efficiency) that can be achieved with the various taggers, as a function of 
transverse momentum. These are shown in Fig. 13 for quark and gluon backgrounds. One 
observes that the Y-splitter with trimming method outperforms the taggers discussed here, 
particularly at high pp. Note that we have used Rtrim = 0.3 for Y-splitter with trimming. 
For trimming however this represents a non-optimal choice at high pp (see Fig. 18 later) and 
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Signal significance with quark bkgds 


Signal significance with gluon bkgds 




Figure 13. Signal significance for tagging hadronic H jets with quark (left panel) and gluon (right 
panel) backgrounds using Herwig++ 2.7.0 [44] with underlying event and hadronisation as a function of 
a generator level cut pp on transverse jet momentum. We compare the signal significance for different 
algorithms to Y-splitter+trimming and find that the latter outperforms the others at high pp. Here 
we have used i?t r im = 0.1 for pure trimming since, in contrast to Y-splitter+trimming, we expect this 
value to be closer to optimal than i?trim = 0.3 at high transverse momenta (see Fig. 18). 


hence we have chosen to present our results for trimming with i?trim = 0.1. A detailed study 
of optimal parameters for Y-splitter+trimming remains to be carried out and we shall aim 
to present the results of such a study in forthcoming work. 

The results shown in Fig. 13 are for our standard process, pp -+ ZH, but similar results 
are also obtained for W tagging as shown in Fig. 14. Here we observe that Y-splitter with 
trimming now consistently outperforms the other taggers discussed over a range of pp. This 
emerges from the different mass window of the W boson (64 — 96 GeV) compared to the 
Higgs (109 — 141 GeV). In the window M\y ± 16 GeV, the background mass distribution 
of Y-splitter+trimming is smaller relative to Y-pruning than the window around Mh (see 
Fig. 12). Hence, we observe a greater signal significance tagging W rather than Higgs relative 
to the other taggers for large pp. 8 

7 Optimal parameter values 

In this section we shall use analytical expressions to derive values of parameters that maximise 
the signal significance for the different taggers. We do not expect the values so derived 
to really be optimal in the sense that they will not take into account non-perturbative effects. 

8 We have performed preliminary studies for other possible combinations such as Y- 
splitter+mMDT/pruning/soft drop. These all have a similar qualitative effect on both the background 
and signal jet mass distribution as Y-splitter+trimming. Hence, one observes a comparable gain in signal 
significance over Y-splitter for all of these combinations. However, we find that Y-splitter+trimming has the 
best signal significance for tagging W bosons over background in the high pr limit. 
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Signal significance with quark bkgds. Signal significance with gluon bkgds. 




Figure 14. Signal significance for tagging hadronic W jets with quark (left panel) and gluon (right 
panel) backgrounds using Herwig++ 2.7.0 [44] with underlying event and hadronisation as a function 
of a generator level cut pp on transverse jet momentum. We deem a jet tagged if it has a final 
mass in the window 64 — 96 GeV. We compare the signal significance for different algorithms to Y- 
splitter+trimming and find that the latter outperforms the others at high pp. In this plot, we use all 
tagger parameters which match those used for W tagging in the paper [31] for ease of comparison. 

Indeed we should emphasise that optimal parameter values have already been extracted us¬ 
ing full MC studies for all methods considered in the original papers and also examined in 
subsequent studies such as in Ref. [26]. Analytical studies of optimal parameters have also 
been carried out by Rubin in Ref. [34] in the context of a filtering analysis, which we do not 
consider here. 

Nevertheless we can regard it as one of the tests of the robustness of these methods that 
the values derived here with analytical formulae as inputs should be reasonable approxima¬ 
tions to what one obtains in complete MC studies. This is because one wants ideally to have 
substructure methods where statements about performance are largely independent of our 
detailed knowledge about non-perturbative corrections. We are also interested in examining 
to what extent general trends that emerge with analytics, such as the dependence of optimal 
parameters on pp, are replicated in full MC studies. For the following studies we confine 
ourselves to quark backgrounds as we have no reason to believe that gluon backgrounds will 
differ significantly in terms of the conclusions we reach here. 

Having observed in this paper the relatively small radiative corrections, both for ISR 
and FSR, that emerge for signal processes over a broad range of parameter values, one feels 
encouraged in a first approximation to turn off these effects and treat the signal in a tree-level 
approximation, except for the case of trimming as we discuss below in more detail. In other 
words we anticipate that the signal significance ought to primarily be driven by the tree-level 
results for signal while for the background we shall use the resummed formulae first derived in 
[31]. For self-consistency, one should then also verify that for the optimal values one derives, 
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the radiative corrections to signal efficiency can indeed be considered small relative to the 
tree level result. 


7.1 mMDT 


Let us follow the above described procedure for the mMDT and extract the optimal value of 

2/cut ■ 

One needs to study the background mistag rate in the window Mh — 5M < Mj < 
Mh + SM which corresponds to a range in p, pn — 5p < p < pn + Sp, with 5p « 2 MnSM/p^ 
where we have used R = 1. 

We then have the following expression for the signal significance: 

_ 1 — 2r/cut 

V^b ^/E (2/cut, PH + Sp) - E (y cut , ph ~ Sp) ’ 

M 2 

with pu = —f~ and where E(y cu t, p) is the integrated mMDT background jet mass distribution 
Pt 

calculated in Ref. [31]. Thus the quantity E (// cu t, Ph + Sp) — E (y cu t, Ph — Sp) represents the 
integral of the background jet-mass distribution over the mass window corresponding to signal 
tagging with mMDT. Note that we have treated the signal efficiency at lowest order. 

We can find the value of y CVL t that maximises signal significance by taking the derivative 
of the RHS of Eq. (7.1) wrt y cut and setting it to zero which gives: 


-4 _ E ; (ycut, PH + Sp) - E' (g/ cut , PH - Sp) 

1 - 27/cut E (y cut , PH + sp) - S (//cut > Ph ~ Sp) ’ 


(7.2) 


where E 7 denotes a derivative wrt y cu t- Neglecting higher order corrections in 5p, the optimal 
value for y cu t satisfies 

d ( 


-4 d v° ut \dp) p=PH 


1 2//cut 



P=PH 


(7.3) 


Next we use the analytical expressions for E {y m t,p) derived in Ref. [31]. The fixed-coupling 
result for the mMDT for p < y C ut reads: 


£ (2/cut, p) = exp 


CfCKs 
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(7.4) 


We can use this result in Eq. (7.3), assuming that the optimal value lies in the region 
p < //cut, 9 and doing so gives us an implicit equation for optimal //cut = //max^ 


-4 y max .o OL a 1 ^/max 4 

-= G p — In----• 

1-2 y max 7 T p H 3 + 4 In //max 


(7.5) 


One can numerically solve the above equation, which contains the essential information 
about the optimal y cut and its dependence on px- The values we obtain for pr = 1,2,3 

9 This is reasonable at high pr , since pH (ITeVj fis 0.015 much smaller than typical y cu t values ~ 0.1 quoted 
in the literature. 
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TeV with a s = 0.1 are approximately 0.124, 0.102 and 0.088 respectively. Whilst we have 
not included running coupling effects in the above derivation, one finds it is straightforward 
to do so. Using the full calculation of Ref. [31] for the background, i.e. including running 
coupling effects and a transition to the plain mass like behaviour for p > y cut , we compute 
the analytical signal significance plotted in Fig. 15 as a function of y C ut- 

From Fig. 15 we note firstly that the peak position of the analytical signal significance 
is approximately in agreement with the numbers we quoted immediately above for the fixed- 
coupling calculation. A kink can be seen in the analytical result for px = 1 TeV, the origin 
of which is the transition from a single-logarithmic dependence on p valid at p < y cut , to 
the usual double logarithmic result for the plain mass for p > y cut . We have also shown, 
in the same figure, results from Herwig++ 2.7.0 at both parton level and at full hadron 
level including UE. We find the Herwig+-1 - 2.7.0 results at parton level in quite reasonable 
agreement with the simple analytical estimates we have made, for both the peak positions 
and the evolution of optimal y cut with px, though the values of the peak signal significance 
itself differ somewhat. It is noteworthy also that hadronisation and UE do not change the 
picture significantly at the px values we have studied here. One other feature that emerges 
from both analytical and MC studies is that the peaks themselves are fairly broad so that 
choosing a slightly non-optimal y cu t does not greatly impact the tagger performance. We have 
also provided in Fig. 15 a direct comparison between optimal values from Herwig+-1 - 2.7.0 
(including all effects) and analytical estimates. We show the results for the range of y cu t 
values (denoted by the pink shaded region) that correspond to a ±2% variation around the 
peak signal significance. For Herwig+-|- 2.7.0 instead we indicate the same range of y cu t values 
by the blue bars shown. We find a good degree of overlap within this tolerance band between 
full Herwig++ 2.7.0 results and analytical estimates. 

One can draw at least a couple of inferences from our observations above. Firstly, as we 
have argued, radiative corrections to the signal are clearly of minor significance to the tagger 
performance for mMDT. The fact that the analytics are generally in good agreement with 
Herwig+-1- 2.7.0 points to the importance of the background contribution in the context of 
the signal significance and the success of analytical approaches in describing this background 
[31]. The fact that non-perturbative effects play an evidently minor role at the values of px 
studied above is also reassuring from the point of view of a robust understanding of tagger 
performance. 

We end with a caveat. If one moves to still lower px values then one has to reconsider 
some of the arguments above. Here one would have a situation where say at 200-300 GeV 
we expect to be in the region where p > y C ut and so Eq. (7.5) does not directly apply. This 
apart, perhaps more significantly one can expect UE to start playing a larger role due to the 
larger effective radius ~ where UE particles accumulate without being removed by the 
asymmetry cut. Here one ought to consider the use of rnMDT with filtering and optimise the 
parameters of both methods together as in the original analysis [3] . 
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Figure 15. mMDT analytical signal significance from tree level signal and resummed background 
as a function of y cut (top left) compared to Herwig+H- 2.7.0 [44] at parton level (top right) and with 
hadronisation and MPI (bottom left). The signal process used is pp —> ZH where we require the 
Higgs and Z to decay hadronically and leptonically respectively with quark backgrounds. We place a 
generator level cut on the Higgs transverse momentum pt of 1, 2 and 3 TeV. Jets are tagged around 
the Higgs mass with a mass window 5M = 16 GeV. The bottom right panel shows the analytic 
optimal y cut values as a function of pt (red line) with a 2% variation in signal significance about the 
peak (red shaded area). We overlay the optimal results for y cut obtained using Herwig++ 2.7.0 with 
hadronisation and underlying event at 1, 2 and 3 TeV, with an equivalent 2% variation about the peak 
signal significance (blue bars) and at parton level (black bars). 
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Optimal parameters: 



Figure 16. Pruning analytical signal significance from tree level signal and resummed background 
as a function of z cut compared to HerwigH—f- 2.7.0 at parton level and with hadronisation and MPI. 
Details of generation given in Fig. 15. 


7.2 Pruning and Y-pruning 

Here we carry out a similar analysis for the case of pruning. The resununed expression 
for pruning, for QCD jets, is considerably more complicated than for mMDT. The result 
essentially has two components which in Ref. [31] were dubbed the Y and I components 
respectively. We have already dealt with Y-pruning in some detail in this article in the 
context of signal jets. For the background, as we have also discussed in a previous section, 
Y-pruning, for small jet masses, p < z^ ut , consists of a suppression of the leading order single- 
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logarithmic result by a Sudakov like form factor and gives rise to a desirable suppression of 
the background in the signal region, for high px values. The I pruning contribution, on the 
other hand, starts at order a 2 S and is as singular as the plain jet mass i.e. double logarithmic. 
For the sum of Y and I components, i.e. for pruning as a whole, one observes two transition 
points: for p > £ cu t the behaviour is like the plain mass (as for mMDT), for < p < z C ut 
there is a single log behaviour as in the leading-order result and as for mMDT, while for 
p < zl nt we see the I-pruning contribution starts to become more important which can cause 
growth of the background and the appearance of a second peak for quark jets and a shoulder 
like structure for gluon jets. 

We do not, for brevity, present here the resummed results for pruning for QCD jets, 
referring the reader instead to section 5.3 of Ref. [31]. Here we simply plot the analytical 
signal significance for pruning as for mMDT, with neglect of radiative corrections to the signal 
efficiency, but with the full resummed calculation for QCD background, which we take to be 
quark jets alone. The resulting signal significance is displayed in Fig. 16 along with MC 
results at parton and hadron level. One would expect the optimal z cut to lie in a region that 
corresponds to the mMDT like region i.e. such that pn > ut . Choosing a larger z cu t would 
push us into the region where the background starts to grow due to onset of I-pruning and 
hence the signal significance falls off. At 1 TeV where pu ~ 0.0017 one can expect an optimal 
value of z CVLt to be below yj~pu ~ 0.125 while for 3 TeV one may expect a value closer to 0.04 
and these expectations are roughly consistent with what one notes with both the analytical 
and MC results shown. Once again we observe that non-perturbative effects do not change 
the essential picture one obtains from analytics and have only a limited impact on the signal 
significance relative to parton level. 

The pruning results have clear qualitative differences from the case of mMDT. In partic¬ 
ular at higher px we have to be more precise about the choice of z cu t due to the somewhat 
narrower peak in the signal significance. We can compare, as for mMDT, analytical results to 
those from Herwig-(— |- 2.7.0 , once again with a ±2% tolerance band shown in the bottom right 
figure of Fig. 16. We observe that within this small tolerance band the results are compatible 
though at higher px perhaps less so than for mMDT. 

In the original paper [20], the authors conclude that the optimal z cu t value for pruning is 
0.1 when using the C/A algorithm to cluster the initial jet, as we do here. This optimisation 
was performed at a moderate transverse momenta (100 — 500 GeV for W bosons) compared to 
this paper, however our results are consistent as we approach this region. For larger boosts, we 
observe that the optimal value choice for z cn t tends to slightly smaller values (z cu t ~ 0.075). 

We also present in Fig. 17 results for the signal significance of Y-pruning, again taking 
quark jets as background. Here we note firstly that analytics are again broadly in agreement 
with MC results for the shape of the signal significance as a function of z cu t- Secondly the 
peaks are quite broad and so choosing a somewhat non-optimal value of z cu t does not critically 
affect the significance. Furthermore, the optimal z cut does not depend strongly on px and is 
virtually constant over the limited px range studied. Lastly within a ±2 % tolerance band 
there is good agreement between full MC results and simple analytics on the optimal values 
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Figure 17. Y-pruning analytical signal significance from tree level signal and resummed background 
as a function of z cut compared to HerwigH—I- 2.7.0 at parton level and with hadronisation and MPI. 
Details of generation given in Fig. 15. 


of Amt • 

Hence for mMDT and Y-pruning and to a slightly smaller degree for pruning we find 
that, over the pp values we studied here, analytical results based on resununed calculations for 
QCD background and lowest order results for signals, with neglect of non-perturbative effects, 
capture the essential features of tagger performance, as reflected in the signal significance. An 
extension of our studies to lower pp values would be of interest in order to ascertain the further 
validity of the simple picture we have used for our analytical results and probe in more detail 
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the role of radiative corrections to the signal and that of non-perturbative contributions. We 
shall next examine the more involved case of optimal parameters for trimming. 

7.3 Trimming 

Here we carry out a similar analysis for trimming, but one now has to optimise two param¬ 
eters, -Rtrim and / cut . As performed for the pruning analysis, we use the analytic resummed 
expression for QCD jets given in Ref. [31]. The result for the background jet mass distri¬ 
bution consists of a region with single log behaviour (equivalent in structure to mMDT) for 
/cut Antrim < P < /cut which transitions into a double logarithmic growth in the background 
distribution for p < / cut i?j( rim . However, in contrast to mMDT and (Y-)pruning, FSR ra¬ 
diative corrections to the signal efficiency are crucial for optimisation. If one naively uses 
the tree level result given in Eq. (3.2), it follows that the optimum value for i? t rim tends to 
zero. This is because one can ensure that signal mass window is within the single logarithmic 
region by simply pushing the location of the double logarithmic transition (p = / cut R^ rim ) to 
small values of the QCD jet mass, thereby avoiding the double logarithmic peak. 

However, as shown in this paper, in the limit Rtrim —> 0, one encounters large logarith¬ 
mic corrections to the signal efficiency associated with final state radiation from the signal 
jet (see Eq. (3.11)). This puts a limit on how small one can reduce the trimming radius 
whilst maintaining reasonable signal mass resolution. Hence, we now include FSR radiative 
corrections to the signal efficiency by integrating the the expression given in Eq. (3.10) over 
z and adding this term to the Born level result Eq. (3.2). Including this radiative correction, 
along with the resummed QCD background, we can obtain analytical estimates for the signal 
significance. 

In Fig. 18 we show a 2D density plot for the signal significance with trimming over a range 
of -Rtrim and / cu t values using Monte Carlo at parton level (top) and with full hadronisation 
and underlying event (bottom) with a transverse momentum cut at 2 and 3 TeV left and 
right respectively. We overlay a black analytical contour representing the region in which 
the analytical signal significance is no more than ±2% away from the analytically derived 
peak value for Rtrim and f cu t- One can see that we have reasonable agreement between the 
simple analytical estimates and the Herwig++ 2.7.0 results at parton level. However, when 
one includes non-perturbative effects, we observe that contamination from underlying event 
significantly reduces the signal significance as Rtrim increases. 

We can use our simple analytical estimates to comment on the optimal values we observe 
from MC. Firstly, for optimal values of / cu t and Rtrim, one would expect the signal mass 
window to reside in the single logarithmic mMDT-like region of the background, hence we 
anticipate that the optimal parameters satisfy the constraint A > / cu tAntrim- This expectation 
is consistent with both the analytical contour (top right edge) and MC results both at parton 
and full hadron level. This background driven effect is manifest as a suppression in signal 
significance when the product / cu t Antrim becomes large (i.e top right of the contour plots). For 
example, at 3 TeV and fixed / cut = 0.1, one would analytically expect an optimal value for 
-Rtrim ;$ 0.13, whilst at / cu t = 0.05 one expects i? t rim < 0.19. These numbers are in agreement 
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Figure 18. 2D density plot showing the trimming signal significance as a function of Rtrim and 
/cut using HerwigH — h 2.7.0 for H —► bb jets with quark backgrounds with a minimum jet transverse 
momentum cut. The top panels are generated at parton level with transverse momenta 2 TeV and 
3 TeV left and right and the bottom panels include hadronisation and underlying event. The area 
inside the black contour represents the analytic prediction with FSR radiative corrections to the signal 
efficiency for optimal values within 2% of the analytic peak signal significance. 


with the analytical contour and MC results. Secondly, FSR corrections to the signal efficiency 
become significant in the region /? 2 rirn -C A, hence one would expect the optimal trimming 
radius to reside in the region i?trim <1 \/A. At 3 TeV this corresponds to Rtrim >0.04 and at 
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Figure 19. Trimming signal significance for Rtrim ~ \/A ~ 0.06 as a function of / cut generated at 
parton level and with hadronisation and MPI using Herwig++ 2.7.0 . 

2 TeV corresponds to -Rtrim > 0.06. This is consistent with the analytical contour and MC, 
where we observe a reduction in signal significance in the limit Rtrim \/A. 

We notice that, like mMDT and pruning, the signal significance is fairly insensitive to 
variations in / cut provided we choose Rtrim such that sj A// cut > Rtrim- However, the signal 
significance is subject to non-perturbative corrections which increase with Rtrim; and con¬ 
sequently one should favour the small Rtrim limit of the analytical optimal contour region 
Rtrim ~ \/A to minimise both signal FSR and non-perturbative corrections to the signal 
significance. It is thus of interest to choose Rj( rim ~ A and study the dependence of the signal 
significance on the choice of / cut as in Figs. 15, 16, 17. The results can be found in Fig. 19 
for Rtrim = 0.06 which is identical to A at 2 TeV and of order A for the other px values. 
With the given choice of Rtrim; reminiscent of the pruning radius, it is natural to compare 
the results to those for pruning reported in Fig. 16. One notes that even with a similar choice 
of radius there are differences between the two techniques. While for pruning the optimal 
z cu t decreases with increasing px the optimal value for trimming stays more constant. The 
peak signal significance itself increases with px in both cases. For a given px the behaviour 
as a function of f cn t is also different, especially at larger / cut . These differences originate in a 
number of sources: the difference in FSR corrections and their px dependence which is more 
pronounced for trimming, differences in the definitions of f cn t and z cu t and last but not least 
differences arising from QCD background jets with pruning and trimming (see Ref. [31]). In 
order to better understand the role for example of FSR effects, in the above context, we note 
that for pruning one can simply replace the signal efficiency by 1 — 2z cu t as we have done for 
our analytical studies of optimal parameter values in pruning. If one similarly uses 1 — 2/ cu t 
in computing the signal efficiency for trimming one observes that the result with Rj[ rim ~ A 
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is closer to that for pruning and optimal / cut values show a very similar trend with px to 
those for pruning. However for trimming px dependent FSR corrections cannot be neglected, 
especially at low px , and play an important role in pushing the optimal / cu t to smaller val¬ 
ues than would be obtained by turning off FSR effects. This is the main reason behind the 
relative insensitivity of optimal f cu t values seen with trimming, over the px range studied in 
Fig. 19. 

8 Conclusions 

In this article we have studied perturbative radiative corrections and non-perturbative effects 
for the case of signal jets, specifically for boosted Higgs production with H —> bb, with the 
application of jet substructure taggers. For the former we have carried out relatively simple 
analytical calculations both to assess the impact of ISR and FSR as well as to study its 
dependence on various parameters, such as a mass window of width SM on either side of the 
signal mass, the fat jet px , the mass of the resonance Mjj, and the parameters of the various 
taggers. To examine non-perturbative effects we have confined ourselves to MC studies. 

Our study was motivated by relatively recent calculations dedicated to the case of QCD 
background jets and in particular work presented in Ref. [31]. There it was noted that while 
taggers should in principle discriminate against jets from QCD background, the degree to 
which this happened and the impact on the background jet mass distribution was not always 
as desired. While taggers such as pruning, mMDT and trimming were essentially identical 
over a limited range in the normalised square jet mass p = Mj / (p?pR 2 ), significant differences 
in performance and behaviour were observed at small values of p, which especially at high 
px corresponded to masses in the signal mass region of interest. Likewise taggers should, in 
principle, not affect significantly signal jets, retaining them as far as possible. Additionally 
most taggers have a grooming element (via the / cu t/2/cut/-Yut criteria) that is responsible 
for clearing the jet of contamination from ISR/UE thereby helping in the reconstruction of 
sharper mass peaks. Here our aim was to carry out analytical and MC studies to investigate 
in detail the impact of taggers on signal especially with regard to the interplay between tagger 
parameters as well as kinematic cuts such as jet px , masses and mass windows. 

Our findings on the whole indicate that tagger performance is more robust for the case 
of signal jets than was apparent for QCD background. Most taggers are quite similar in their 
response to ISR and generally significantly ameliorate the loss of the signal efficiency seen for 
plain jet mass cuts, without these substructure techniques. An exception to this situation 
was the case of Y-splitter where the ISR and UE contamination resulted in a loss of signal 
efficiency identical to that seen for plain jets. 

Likewise for FSR, the radiative losses that one sees are on the whole modest for a rea¬ 
sonably wide range of tagger parameters. Here an interesting question opens up about the 
potential role for fixed-order calculations in the context of jet substructure studies. This is 
because one observes an absence of genuine logarithmic enhancements for sensibly chosen 
tagger parameter values. The signal efficiency, for the taggers studied here, ought then to be 
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Table 1 . A table summarising the logarithmic structure of radiative corrections to the signal efficiency 
for each tagger. For each tagger we show the coefficient of — as ^ F for ISR and FSR results in the 
small A and 2 C ut/j/cut//cut limit. We have defined e = 28 M/Mh as in the main text. The coefficient 
C 2 for the trimming FSR logarithm is given in Eq. (3.12). 


better described by exact calculations that incorporate hard gluon radiation or by combina¬ 
tions of matrix element corrections and parton showers than by the soft/collinear emissions 
encoded in pure parton showers. We carried out a comparison between an MC description 
of the signal efficiency and exact order a s results for various taggers, reported in appendix 
B. We find that we can reasonably adjust parameters such as the size of our mass window 
SM to obtain good agreement between the two descriptions. Such observations may also be 
useful beyond the immediate context of our work, in situations where differences in tagger 
performance could come from regions of phase space that are not under the control of a soft 
eikonal approximation. In these situations one would ideally want to combine resummed 
calculations, where necessary, with fixed-order calculations i.e. carry out matched resummed 
calculations. A summary of the results presented in this paper for the logarithmic structure 
of radiative corrections to the signal efficiency for each tagger are given in Table. 1. 

A development we have made here is the introduction of a combination of Y-splitter with 
trimming in an attempt to improve the response of Y-splitter to ISR/UE contamination. The 
main reason why we made this effort was due to the fact that we observed that Y-splitter was 
very effective at suppressing QCD background in the signal region. The resulting improvement 
of signal efficiency coupled with the fact that the background suppression from Y-splitter 
remained essentially intact after the use of trimming, meant that the combination of Y- 
splitter+trimming actually outperforms other taggers studied here, in particular, at high px- 
Our observation is in keeping with the general idea that suitably chosen tagger combinations 
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may prove to be superior discovery tools compared to currently proposed individual methods 
[64] . In fact it is now becoming increasingly common to use combinations of techniques such 
as N-subjettiness [62] with for instance mMDT in an effort to maximise tagger performance 
(see e.g. [65]). There is also much effort aimed at better understanding tagger correlations 
and we expect that our forthcoming analytical calculations for the case of Y-splitter with 
trimming will shed further light on some of these issues [63]. 

Lastly, we have carried out an analytical study of optimal parameter values for various 
taggers. Having observed modest radiative corrections to the signal we neglected these effects 
and found that analytical estimates, based on lowest order results for the signal and resummed 
calculations for QCD background, generally provide a good indicator of the dependence of 
signal significance on the tagger parameters. The analytical formulae which also do not 
include non-perturbative effects give rise to optimal values that are fairly compatible with 
those produced by full MC studies. This is encouraging from the point of view of robustness 
of the various methods considered since a dependence of optimal values on MC features 
(hadronisation models or MC tunes) are potentially not ideal. 

We note in closing that for other methods, such as N-subjettiness for example, there 
will also be a suppression of signal jets due to the fact that such observables directly restrict 
radiation from the signal prongs. Thus in those cases radiative corrections arising from 
soft/collinear emissions by signal prongs are highly significant as can be noted from Ref. [35] . 
We hope that our work taken together with studies of such observables will enable a more 
complete understanding of features of signal jets in the context of jet substructure studies 
and provide yet stronger foundations for future developments. 
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A Angular integration for FSR 

To work out the coefficient of the soft FSR we need to perform the angular and z integrals 
for the antenna pattern in Eq. (3.7) for trimming and likewise for all taggers. Generally, for 
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a single gluon emission, one has to evaluate the contribution from FSR emission outside two 
cones of radius r centred on the b and b quarks. The choice of r depends on the tagger in 
question, so after carrying out the angular integration, one can set r 2 as i?.^ rim for trimming, 
A for pruning and = A/(z(l — z)) for rnMDT and lastly integrate over z. 

One has to then evaluate the integral 


I = 


dQ 

2 tt (T 


1 ~ co s9 bb p,(n2 

cos6 bk )(l - cos0~ bk ) ^ bk 


-r 2 ) {el 


2 

bk 



(A.l) 


where we have now explicitly written the conditions for the gluon to be at an angle 6 2 > r 2 
wrt both hard partons. 10 The simplest way to evaluate the integral above is to first consider 
an integration over the entire solid angle and then to remove the contribution from inside two 
cones around the hard parton directions. We shall assume that the cones do not overlap, so 
shall consider r < 9 bb / 2. For larger r, as appropriate for mMDT where r = 9 bb , we perform 
a numerical calculation and find that our results agree with those of Rubin [34] . 

Therefore we write 

I = 4ll - k b ~ k- b (A.2) 

where I a n is the integration over the full solid angle and Ic h are the integrals inside the 
region corresponding to cones around b and b directions respectively. I a n can be evaluated by 
standard techniques and yields, after azimuthal averaging, the textbook result corresponding 
to angular ordering of soft emission. 


-'all 


(A.3) 


The contribution inside the cone around b , Ic b , can be evaluated as follows. Taking the 
b direction as the “z” axis we define the parton directions by the unit vectors: 


n b = (0,0,1), 

n b = (0,sin(%,cosi%), 

n k = (sin 6 b k sin cj>, sin 0 bk cos cos 6 b k) ■ (A.4) 


The in-cone subtraction term for Ci can then be written as 


Ic b = J (cos 6bk) 


i - cos e bb 


(1 - cos 6bk) (1 - COS 9 bb cos 9 b k - sin 9 b k sin 9 bb cos <j>) 


e (r 2 - 9 2 bk ) 


Integrating over the azimuthal angle 4> gives 


Ic b = / d (cos 9 b k) 


1 - cos 0 h z 


bb 


(1 — cos 9 b k ) | cos 9bk — cos 9 t 


3© P - el). 


(A.5) 


(A.6) 


bb I 


10 While we have retained, at this stage, the full angular antenna pattern for ease of comparison to standard 
formulae, we shall later take the small angle approximation to compute the final answer. 
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This term can be combined with the corresponding contribution (the first term) in / a n, and 
taking the small-angle approximation for cos# ~ 1 — 9 2 / 2! one obtains 


I = 


del I ^ - 


e li 




f e bb dd'l - 

+ ^r + b ^ b > 
Jr 2 e bk 


(A.7) 


where we have also included Ic h via the interchange b -H- b. The collinear divergence along 
each hard parton direction is cancelled by the in-cone contributions, leaving only a wide-angle 
contribution. Carrying out the angular integrations we get 


1 = 2 log 


% - ^ 


(A.8) 


which agrees with the result found by Rubin [34] written in terms of the variable r/ = fj r , for 
r) < In the collinear limit, r <C 9 bb , we get the result for trimming quoted in the main text 
and used in Eq. (3.10). To obtain the result for pruning we substitute 9 2 b = A/(z(l — z)) and 
r 2 = A, then carry out the z integral, which gives: 

1 = 2 L„ dzhl i z(i-z) ) ) = ^ + OM ’ (A ' 9) 

which corresponds to the result quoted for pruning in Eq. (4.4). 

For mMDT where r = 9 bb our calculation above, which assumed non-overlapping cones 
around the b and b. does not apply. For this purpose we have evaluated the angular integration 
numerically and for Mh/pt 1 be. when one can use the small-angle approximation, the 
result is ~ 0.646 as found by Rubin for the corresponding quantity J(l). 


B Fixed-order results vs parton showers for FSR corrections 

We have noted that FSR computed using the soft approximation gives numerically very small 
corrections to the leading-order results, for sensible choices of the mass window 5M, and 
the tagger parameters y cu t, z C ut> /cut and Rtrim- This of course means that such calculations 
are not a good guide to the actual tagger performance i.e. the signal efficiency, since they 
do not produce genuine logarithmic enhancements. One can expect instead that fixed-order 
calculations, with correct treatment of hard non-collinear radiation at order a s and beyond, 
will provide a better picture of the behaviour of taggers. Given that resummation effects are 
not likely to be significant it becomes of interest to compare signal efficiencies obtained with 
pure fixed-order calculations to those from MC generators. One may anticipate that precise 
order a s calculations give quite similar results to full MC parton showers, owing to the 
dominance of hard radiation and the consequent lack of importance of multiple soft/collinear 
emissions. 

To test this we ideally need to carry out an exact order a s calculation for the process 
H —> dbg. Such a calculation can be straightforwardly performed by taking the exact H —> bbg 
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SHERPA shower vs EVENT2: mMDT 

5M [GeV] 



SHERPA shower vs EVENT2: Pruning 


SM [GeV] 



Figure 20. Ratio for signal efficiency normalised to lowest-order result, with EVENT2 and Sherpa 
2.0.0, for e + e _ annihilation with virtual Z production and hadronic decay, where we consider a Z 
boson with a transverse boost to px = 3 TeV. 

matrix element and integrating over phase space after application of cuts corresponding to 
jet finding and tagging in various algorithms. While straightforward this exercise proves 
cumbersome and has in any case to be carried out with numerical integration. One may 
instead try to obtain the same information more economically by exploiting existing fixed- 
order codes. 

One of the most reliable and long-standing fixed-order programs available to us is the 
code EVENT2 [66] for e + e - annihilation. We can exploit this program by considering the 
process e + e _ -A Z° —> qq at lowest order and with an extra gluon emission i.e. up to order 
a s . One can perform a boost such that the Z° is produced with a large momentum along 
a given direction and then its decay products will, a significant fraction of the time, form a 
single fat jet. One can then apply the boosted object taggers to tag the Z boson imposing 
a mass window requirement SM, around Mz as we have done throughout this paper, for 
the Higgs boson. The situation is similar but not identical to the case of the Higgs we have 
thus far considered, due to the polarisation of the Z boson so that the matrix element for Z 
decay to quarks differs from Higgs case and efficiencies at tree-level and beyond are affected, 
giving for example a different dependence on z cut , y cut at lowest-order. Nevertheless all of our 
conclusions about radiative corrections apply to this case as well, including our findings about 
the logarithmic structure of FSR contributions, since these results follow from the radiation 
of a gluon from the qq pair, which is given by a process independent antenna pattern, that 
factorises from the process dependent lowest order decay of a scalar (i.e. Higgs) or a Z boson. 

Therefore in order to test our basic notion that fixed-order calculations should give a 
comparable FSR contribution to tagging efficiency, to that from MC event generators, it 
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should suffice to study results for boosted Z bosons from EVENT2 on the one hand and MC 
on the other. In order to minimise any process dependence one should choose precisely the 
same hard process for both the fixed-order and MC and hence we choose to study the virtual 
Z boson contribution in e + e _ —> qq events, with the hadronically decaying Z boosted to 3 
TeV (as in the EVENT2 case) with the MC generator Sherpa 2.0.0 shower [67]. 

We first study the signal efficiencies, normalised to the lowest order result, that are 
obtained with EVENT2 and Sherpa 2.0.0 for mMDT and pruning for y cu t = z cu t = 0.1. These 
are shown in Fig. 20 as a function of the mass window, where e = 2 5M/Mz as in the main 
text. 

A first observation is that there is a reasonable degree of qualitative and quantitative 
similarity between LO and shower estimates, over a wide range of mass windows, which 
establishes further our point about the essential perturbative stability of taggers against 
FSR corrections. The difference between the normalised signal efficiencies for SHERPA and 
EVENT2 are 2% or less when SM is greater than ~ 8 GeV or ~ 13 GeV for mMDT and 
pruning respectively. One should not in any case consider mass windows significantly lower 
than these values at high pp, in order to minimise NP hadronisation corrections from ISR. 
Differences start to become more marked for very low mass windows in particular for pruning, 
signalling the need for resummation and hadronisation corrections. We have also verified that 
hadronisation corrections have a minimal impact above the 8M values stated above and hence 
basically preserve the picture one obtains already at leading-order. One can also similarly 


SHERPA shower vs EVENT2 Difference: Pruning SHERPA shower vs EVENT2 Difference: Trimming 
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PT [GeV] 
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Figure 21. Contour plot showing the maximum percentage difference in signal efficiency between 
EVENT2 at order a s and Sherpa 2.0.0 final state shower both normalised to the lowest order result. 
In the left hand panel we apply pruning with different values for e and ^ cut with pp = 3 TeV. In the 
right hand panel we apply trimming with different values for \/A and i?trim with / cut = 0.1. 

study trimming where the choice of i?trim is additionally crucial to ensure that radiative 
corrections are minimised so that signal efficiency is maintained. Another way of making 
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this comparison is provided in Fig. 21 where we show the difference between EVENT2 and 
Sherpa 2.0.0 efficiencies (normalised to the lowest order result) as a function of z cut and e for 
pruning and as a function of i?trim and A for trimming. The values of SM, corresponding 
to the e = 25M/Mz values, are shown on the upper axis for pruning and values for px 
corresponding to A are shown for trimming. The blue shaded region in each case represents 
parameter values where the difference between the normalised signal efficiencies for Sherpa 
2.0.0 and EVENT2 are less than two percent while the green and pink regions correspond to 
less than five and ten percent respectively. From the plot for pruning one notes that there is 
a correlation between values of e and z cu t needed to minimise radiative corrections. As one 
goes up in z cu t, to stay within say the five percent zone, one has to correspondingly increase 
the size of the window. This is also in accordance with expectations from our simple analytics 
where one can expect large radiative corrections for e <C z cut . 

Also, for trimming, one may expect a correlation between the value of A and the value 
of -Rtrim required to minimise radiative degradation of mass. This is also reflected in Fig. 21 
where once again the green and pink shaded regions represent differences of 5 percent and 10 
percent respectively, for the normalised signal efficiencies. For px = 600 GeV, for example, 
choosing i?trim ~ 0.13 or larger gives less than 5 percent difference between leading order and 
shower descriptions. As one lowers i?trim radiative losses get progressively larger. 
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